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(57) Abstract: The present invention relates to regulation of angiogenesis. More particularly, the present invention is directed to 
nucleic acids encoding "angiogenesis regulatory proteins and nucleic acids" which are involved in modulation of angiogenesis. The 
invention further relates to methods for identifying and using agents, including small organic molecules, antibodies, peptides, cyclic 
peptides, nucleic acids, antisense nucleic acids, RNAi, and ribozymes, that modulate angiogenesis via modulation of angiogenesis 
regulatory proteins and nucleic acids; as well as to the use of expression profiles and compositions in diagnosis and therapy of 
diseases related to angiogenesis. 
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CROSS-REFERENCES TO RELATED APPLICATIONS 

■i' 

This application is a continuation-in-part of Application No. 10/231,956 filed 
5 August 30, 2002, which is herein incorporated by reference for all purposes. 



FIELD OF THE INVENTION 
The present invention relates to regulation of angiogenesis. More particularly, 
the present invention is directed to nucleic acids encoding "angiogenesis regulatory proteins 
10 and nucleic acids" which are involved in modulation of angiogenesis. The invention further 
relates to methods for identifying and using agents, including small organic molecules, 
antibodies, peptides, cyclic peptides, nucleic acids, antisense nucleic acids, RNAi, and 
ribozymes, that modulate angiogenesis via modulation of angiogenesis regulatory proteins 
and nucleic acids; as well as to the use of expression profiles and compositions in diagnosis 
1 5 and therapy of diseases related to angiogenesis. 

BACKGROUND OF THE INVENTION 
Angiogenesis is typically limited in a normal adult to the placenta, ovary, 
endometrium and sites of wound healing. However, angiogenesis, or its absence, plays an 

20 important role in the maintenance of a variety of pathological states. Some of these states are 
characterized by neovascularization, e.g., cancer, diabetic retinopathy, glaucoma, and age 
related macular degeneration. More specifically, pathological states characterized by 
neovascularization include lymphoma, hematologic cancers, melanoma, breast cancer, lung 
cancer, prostate cancer, colan cancer, ovarian cancer, liver cancer, Kaposi's sarcoma, 

25 metastatic disease, rheumatoid arthritis, psoriasis, and benign proliferative disorders 

including hemangiomas. Others, e.g., stroke, infertility, heart disease, ulcers, delayed wound 
healing, and scleroderma, are diseases of angiogenic insufficiency. Therefore, there is a need 
to identify nucleic acids encoding proteins involved in the regulation of angiogenesis, to 
identify, e.g., modulators of angiogenesis, as well as new therapeutic and diagnostic 

30 applications. 
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BRIEF SUMMARY OF THE INVENTION 



The present application identifies, for the first time, a number of proteins and 
DNA molecules involved in regulation of angiogenesis, e.g., angiogenesis regulatory proteins 
and DNA molecules. The invention further relates to methods for identifying and using 
5 agents, including small organic molecules, antibodies, peptides, cyclic peptides, nucleic 



acids, antisense nucleic acids, RNAi, and ribozymes, that modulate angiogenesis via 
modulation of angiogenesis regulatory proteins and DNA molecules; as well as to the use of 
expression profiles and compositions in diagnosis and therapy of diseases related to 
insufficient or increased angiogenesis. 



an angiogenesis regulatory polypeptide or a fragment thereof encoded by a nucleic acid, 
wherein the nucleic acid hybridizes under stringent conditions to a second nucleic acid 

15 comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:3, SEQ 
ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ED NO:63, SEQ ID NO:68, SEQ ID NO:70, 
SEQ ID NO:76, SEQ ED NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID 
NO:128, SEQ ED NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID 
NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ IDNO:175, SEQ ID NO:183, SEQ ID 

20 NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID 
NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID 
NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ED 
NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ED NO:340, SEQ ID 
NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID 

25 NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ED NO:437, SEQ ID 
NO:439 SEQ ID NO:445, SEQ ED NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID 
NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ED NO:498, SEQ ID NO:519, SEQ ID 
NO:521, and SEQ ID NO:523; and (ii) determining the functional effect of the compound 
upon the nucleic acid or polypeptide. 

30 In one embodiment, the functional effect is determined in vitro. In another 

embodiment, the functional effect is a physical effect. In another embodiment, the functional 
effect is determined by measuring ligand or substrate binding to the polypeptide. In another 
embodiment, the functional effect is a chemical effect. In another embodiment, the 
functional effect is determined by measuring an enzymatic activity. 



10 



In one aspect, the present invention provides a method for identifying a 
compound that modulates angiogenesis, the method comprising the steps of: 

(i) contacting the compound with an angiogenesis regulatory nucleic acid, or 
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In one 9o diment, the polypeptide is expressed in a eukaryotic host cell. In 
a further embodiment, the functional effect is a physical effect. In another embodiment, the 
functional effect is determined by measuring ligand or substrate binding to the polypeptide. 
In a further embodiment, the functional effect is a chemical or phenotypic effect, hi another 
5 embodiment, the functional effect is determined by measuring an enzymatic activity. In 
another embodiment, the host cell is an endothelial cell. In a further embodiment, the 
functional effect is determined by measuring av/33 expression or haptotaxis, or chemotaxis, 

or co-culture tube formation. 

In one embodiment, modulation is inhibition of angiogenesis. 
10 In one embodiment, the polypeptide is recombinant. In another embodiment, 

the nucleic acid comprises a nucleotide sequence selected from the group consisting of SEQ 
ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, 
SEQ ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID 
NO:120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID 
15 NOilds! SEQ ID NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID 
NO:183, SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID 
NoW, SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID 
NO:297, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID 
NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID 
20 NO:34o', SEQ ID NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID 
NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID 
NO:437, SEQ ID NO:439 SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID 
NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID 
NO:519, SEQ ID NO:521, and SEQ ID NO:523. 
25 In another embodiment, the polypeptide comprises a sequence selected from 

the group consisting of SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ 
ID NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO:82, SEQ ID NO:87, 
SEQ ID NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID NO:140, SEQ ID NO:154, SEQ 
ID NO:164, SEQ ID NO.170, SEQ ID NO:172, SEQ ID NO:174, SEQ ID NO:176, SEQ ID 
30 NO:184, SEQ ID NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID 
NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID NO:341, SEQ ID NO:352, SEQ ID 
NO:366, SEQ ID NO:378, SEQ ID NO:385, SEQ ID NO:407, SEQ ID NO:409, SEQ ID 
NO:420, SEQ ID NO:429, SEQ ID NO:438, SEQ ID NO:440, SEQ ID NO:446, SEQ ID 
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NO:457, SEQ ID NO:463, SEQ ID NO:482, SEQ ID NO:485, SEQ IDNO:494, SEQ ID 
NO:497, SEQ ID NO:499, SEQ ID NO:520, SEQ ID NO:522, and SEQ ID NO:524. 

In one embodiment, the compound is an antibody, an antisense molecule, a 
small organic molecule, or a peptide. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates angiogenesis, the method comprising the steps of (i) contacting the 
compound with a nucleic acid, or a polypeptide or a fragment thereof encoded by a nucleic 
acid, wherein the nucleic acid hybridizes under stringent conditions to a second nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:3, SEQ 
ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, 
SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID 
NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID 
NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID 
NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID 
NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID 
NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID N0.329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID 
NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID 
NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID 
NO:439 SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID 
NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:519, SEQ ID 
NO:521, and SEQ ID NO:523; (ii) determining the functional effect of the compound upon 
the nucleic acid or polypeptide; and (iii) expressing the nucleic acid or polypeptide in a cell, 
contacting the nucleic acid or polypeptide with the compound, and determining the 
phenotypic or chemical effect upon the cell. 

In another aspect, the present invention provides a method of modulating 
angiogenesis in a subject, the method comprising the step of administering to the subject a 
therapeutically effective amount of a compound identified as a modulator of angiogenesis 
using the methods described herein. In one embodiment, the subject is a human. In a further 
embodiment, the compound is an antibody, an antisense molecule, a small organic molecule, 
a peptide, or an RNAi molecule. In another embodiment, the compound inhibits 
angiogenesis. 

In another aspect, the present invention provides a method of modulating 
angiogenesis in a subject, the method comprising the step of administering to the subject a 
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therapeutically effecti^fenount of a polypeptide, the polypeptide^^oded by a nucleic acid 
that hybridizes under stringent conditions to a second nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ED NO:63, SEQ ID NO:76, SEQ ID 
NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID 
5 NO:165, SEQ ID NO:183, SEQ ED NO:202, SEQ ED NO:218, SEQ ED NO:232, SEQ ID 
NO:274, SEQ ED NO:2S5, SEQ ED NO:286, SEQ ED NO:297, SEQ ED NO:317, SEQ ID 
NO:318, SEQ ED NO:320, SEQ ED NO:323, SEQ ED NO:324, SEQ ED NO:340, SEQ ED 
NO:377, SEQ ED NO:384, SEQ ED NO:406, SEQ ED NO:408, SEQ ED NO:439 SEQ ED 
NO:445, SEQ ED NO:456, SEQ ED NO:481, SEQ ED NO:484, SEQ ID NO:493, SEQ ED 

10 NO:496, and SEQ ED NO:498. 

In another aspect, the present invention provides a method of modulating 
angiogenesis in a subject, the method comprising the step of administering to the subject a 
therapeutically effective amount of a nucleic acid that hybridizes under stringent conditions 
to a second nucleic acid comprising a nucleotide sequence selected from the group consisting 

15 of SEQ ED NO:3, SEQ ED NO:32, SEQ ED NO:43, SEQ ED NO:57, SEQ ED NO:63, SEQ ED 
NO:68, SEQ ED NO:70, SEQ ED NO:76, SEQ ED NO:81, SEQ ED NO:86, SEQ ED NO:89, 
SEQ ID NO: 120, SEQ ED NO: 128, SEQ ID NO: 139, SEQ ED NO: 153, SEQ ID NO: 163, 
SEQ ID NO:165, SEQ ED NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ EDNO:175, 
SEQ ID NO: 183, SEQ ED NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ED NO:227, 

20 SEQ ED NO:232, SEQ ED NO:248, SEQ ID NO:274, SEQ ED NO:285, SEQ ED NO:286, 
SEQ ID NO:297, SEQ ED NO:307, SEQ ID NO:308, SEQ ED NO:317, SEQ ED NO:318, 
SEQ ED NO:320, SEQ ED NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ED NO:330, 
SEQ ED NO:340, SEQ ED NO:351, SEQ ED NO:365, SEQ ID NO:377, SEQ ED NO:384, 
SEQ ED NO:406, SEQ ED NO:408, SEQ ED NO:419, SEQ ED NO:421, SEQ ID NO:428, 

25 SEQ ED NO:437, SEQ ED NO:439 SEQ ED NO:445, SEQ ED NO:456, SEQ EO NO:462, SEQ 
ID NO:481, SEQ ED NO:484, SEQ ED NO:493, SEQ ED NO:496, SEQ ED NO:498, SEQ ED 
NO:519, SEQ ED NO:521, and SEQ ED NO:523. 

In another aspect, the present invention provides the nucleic acid sequences of 
SEQ ED NO:165, SEQ ED NO:202, SEQ ED NO:210, SEQ ED NO:218, SEQ ED NO:227, 

30 SEQ ED NO:232, SEQ ED NO:248, SEQ ED NO:274, SEQ ED NO:285, SEQ ID NO:286, 
SEQ ED NO:297, SEQ ED NO:307, SEQ ID NO:308, SEQ ED NO:317, SEQ EDNO:318, 
SEQ ED NO:320, SEQ ED NO:323, SEQ ED NO:324, SEQ ID:329, and SEQ ED:330. 
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In another aspect, the present invention provides the amino acid sequences of 
SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID NO:319, SEQ ID NO:325, and 
SEQIDNO:33L 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a schematic diagram of an assay for modulators of 

angiogenesis. 

Figure 2 provides results of an experiment demonstrating the effect of a 
plakoglobin-GFP fusion protein expression on levels of the cell surface marker av/33. 
10 Plakoglobin is an exemplar angiogenesis regulatory protein. Human umbilical vein 

endothelial (HUVEC) cells were transfected with a vector expressing the plakoglobin-GFP 
fusion protein or a control vector, expressing GFP only. Cells were incubated with APC- 
labeled antibodies directed against the cell surface marker av/33. The X-axis depicts cell 
number and the Y-axis depicts the amount of ovj83-APC antibody derived fluorescence. 
1 5 Cells transfected with the plakoglobin-GFP fusion protein construct exhibit lower ov#3 
expression levels than control cells. 

Figure 3 also provides results of an experiment demonstrating the effect of a 
plakoglobin-GFP fusion protein (i.e., GFP-AN-plakoglobin) expression on levels of the cell 
surface marker av/33. The figure also depicts the presence of armadillo repeats in the 
20 functional screen hit, GFP-AN-plakoglobin. 

Figure 4 demonstrates that GFP-AN-Plako expression downregulates 
vitronectin receptors (gv/33 and ov/35) but not the fibronectin receptor (oS/31). 

Figure 5 demonstrates that expression of GFP- AN-Plakoglobin downregulates 
endogenous catenin levels, as well as expression of endogenous plakoglobin and VE- 
25 cadherin. 

Figure 6 demonstrates that GFP-AN-Plakoglobin expression upregulates 
surface levels of the laminin receptor, a6/34. 

Figure 7 provides provides results of an experiment demonstrating the effect 
of a HoxB2-GFP fusion protein expression (i.e., AN-HoxB2) on levels of the cell surface . 
30 marker ov/33. Cells transfected with the AN-HoxB2-GFP fusion protein construct exhibit 

lower av/33 expression levels than control cells. The figure also provides the structure of the 
functional screen hit, demonstrating that the homeodomain is present in AN-HoxB2. 

Figure 8 demonstrates that AN-HoxB2 downregulates both av and /33 integrin 
subunit surface expression, as well as ovj35 surface expression. 
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Figure ^feionstrates that AN-HoxB2 Inhibits ProiBition in HUVECs, a 
primary cell line. 

Figure 10 demonstrates that full length HoxB2 phenocopies AN-HoxB2 in the 
downregulation of GVj33 . 

5 Figure 1 1 provides provides results of an experiment demonstrating the effect 

of a SUSP-1 GFP fusion protein expression (i.e., GFP-SUSP-1) on levels of the cell surface 
marker av/33. Cells transfected with the GFP-SUSP-1 fusion protein construct exhibit lower 
ov/33 expression levels than control cells. The figure also provides the structure of the 
functional screen hit, demonstrating that GFP-SUSP-1 is an N-terminal fusion in the sense 
1 0 orientation and that a cDNA insert is present before the wild-type start codon. 

Figure 12 demonstrates that the GFP-SUSP-1 screening hit does not affect 
proliferation when expressed in HUVEC (human umbilical vein endothelial cells), PASMC 
(pulmonary artery smooth muscle cells), and NHDF (normal human dermal fibroblasts). 

Figure 13 provides expression analysis of SUSP-1 mRNA in a variety of cell 

15 lines. 

Figure 14 demonstrates that SUSP-1 siRNAs inhibit SUSP-1 mRNA 

expression. 

Figure 15 provides a new sequence determination for a nucleic acid encoding 
an ABC transporter protein. 
20 Figure 16 provides a new sequence determination for a nucleic acid encoding 

an HSPA5 protein. 

Figure 17 provides a new sequence determination for a nucleic acid encoding 
a protein disulfide isomerase. 

Figure 18 provides a new sequence determination for a nucleic acid encoding 
25 a novel protein from human chromosome 1 . 

Figure 19 provides a new sequence determination for a nucleic acid encoding 
a novel protein from human chromosome 3. 

Figure 20 provides a new sequence determination for a nucleic acid encoding 
a novel protein from human chromosome 17 
30 Figure 21 provides a new sequence determination for a nucleic acid encoding 

a novel protein from human chromosome 8. 

Figure 22 provides a new sequence determination for a nucleic acid encoding 
a novel protein from human chromosome 9. 
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Figure 23 provides a new sequence determination for a nucleic acid encoding 
a 1/226 protein. 

Figure 24 provides a new sequence determination for a nucleic acid encoding 

an FLJ 10688 protein. 

Figure 25 provides a new sequence determination for a nucleic acid encoding 

a KIAA1 583 protein. 

Figure 26 provides a new sequence determination for a nucleic acid encoding 

a KIAA1 814 protein. 

Figure 27 provides a new sequence determination for a nucleic acid encoding 

a novel protein from human chromosome 4. 

Figure 28 provides a new sequence determination for a nucleic acid encoding 
a peroxidasin/melanoma antigen related protein. 

Figure 29 provides a new sequence determination for a nucleic acid encoding 

a WD40/SOCS box protein. 

Figure 30 provides provides results of an experiment demonstrating the effect 
of a A5FADS antisense screening hit (i.e., GFP-A5FADS or D5ADS) on levels of the cell 
surface marker av/33. Cells transfected with the GFP- A5FADS antisense construct exhibit 
lower av/33 expression levels than control cells. The figure also provides the structure of the 
functional screen hit, demonstrating that GFP- A5FADS is in the antisense orientation and 
that a the anti sense portion of the A5FADS nucleic acid is about 300 base pairs long. 

Figure 31 demonstrates that the GFP-D5FADS antisense screening hit does 
not affect cell proliferation. 

Figure 32 demonstrates that A5FADS siRNAs inhibit A5FADS mRNA 

expression. 

Figure 33 demonstrates that A5FADS siRNAs inhibit haptotaxis. 
Figure 34 demonstrates that A5FADS siRNAs inhibit SDF-1 induced 

chemotaxis in Jurkat cells. 

Figure 35 demonstrates that A6FADS is a functional screening hit. The figure 
shows that A6FADS transcript is spliced to the KIAA1583 functional screen hit. 

Figure 36 demonstrates tha A6FADS mRNA levels are straongly reduced in 
cells that express the antisense KIAA1583 functional screen hit. 

Figure 37 that A6FADS siRNAs inhibit A6FADS mRNA expression and also 
reduce av/33 surface expression. 
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DETAILED DESCRIPTION OF THE INVENTION 

Introduction 

Using a functional assay, angiogenesis regulatory proteins and nucleic acids 
have been identified and cloned from a green fluorescent protein (GFP) fusion library. 
5 Primary endothelial cells were transduced with retroviral cDNA/GFP fusion libraries. 
Transduced cells were selected and assayed for alterations consistent with inhibition of 
angiogenesis, e.g. downregulation of cell surface expression of av/33 or downregulation of 
haptotaxis. 

Using the screen described above, fifty-two proteins and/or DNA molecules 
10 were shown for the first time to regulate angiogenesis. Thirty-four of the proteins and/or 
DNA molecules are known, while fifteen of the proteins and/or DNA molecules are 
completely novel. However, this is the first demonstration of a function in regulation of 
angiogenesis for these molecules. 

The ABC transporter is involved in peroxisome biogenesis. Relevant 
15 sequence data for the protein, nucleic acids encoding the ABC transporter, and related 

sequences include the nucleic acid accession number NM_000033, SEQ ID NO:3; protein 
accession number NP_000024, SEQ ID NO:4; a new sequence determination, SEQ ID NO:2; 
and the functional screen hit, SEQ ID NO: 1 . The protein has an ATP-binding domain. The 
gene encoding the ABC transporter is linked to a genetic disease, adrenoleukodystrophy 
20 (Mosser et ah, Nature 361 :682-3 (1993)). An antisense version of the ABC transporter gene, 
e.g., SEQ ID NO:l, was identified as a functional hit in the screen for modulators of 
angiogenesis. 

PCB1, a poly (rC) binding protein functions as a translational co-activator and 
also binds 3' UTRs of RNA, most likely through KH domains, thereby regulating RNA 

25 stability. Relevant sequence data for the protein, nucleic acids encoding PCB1, and related 
sequences include the nucleic acid accession number NM_006196, SEQ ID NO:32; protein 
accession number NPJ)061 87, SEQ ID NO:33; and the functional screen hit, SEQ ID NO:5- 
3 1 . An antisense version of the gene encoding PCB1 was identified as a functional hit in the 
screen for modulators of angiogenesis. 

30 SLC1 A5, an amino acid transporter, has ten transmembrane spanning 

segments. Relevant sequence data for the protein, nucleic acids encoding PCB1, and related 
sequences include the nucleic acid accession number NM_005628, SEQ ID NO:43; protein 
accession number NPJ)05619, SEQ ID NO:44; and the functional screen hit, SEQ ID 
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NO:34-42. An antisense version of the gene encoding SLC1A5 was identified as a functional 
hit in the screen for modulators of angiogenesis. 

Chromobox homolog 6 is involved in chromatin regulation. Relevant 
sequence data for the protein, nucleic acids encoding chromobox homolog 6, and related 
5 sequences include the nucleic acid accession number NM_014292, SEQ ID NO:57; protein 
accession number NP_055107, SEQ ID NO:58; and the functional screen hit, SEQ ID 
NO:44-56. An antisense version of chromobox homolog 6 was identified as a functional hit 
in the screen for modulators of angiogenesis. 

Cytochrome C oxidase subunit 1 is encoded by a mitochondrial gene and 

10 transfers electrons from cytochrome C to 0 2 (Ingman et aL, Nature 408:708-713 (2000); 
(Leninger, Principles of Biochemistry (1984); Stryer, Biochemistry (1995)) Relevant 
sequence data for the protein, nucleic acids encoding cytochrome C oxidase, and related 
sequences include the nucleic acid accession number NC_00 1807, SEQ ID NO:63; protein 
accession number NP_536845, SEQ ID NO:64; and the functional screen hit, SEQ ID 

15 NO: 59-62. A sense version of the gene encoding cytochrome C oxidase, encoding a 21 
amino acid peptide, was identified as a functional hit in the screen for modulators of 
angiogenesis. 

A-5 fatty acid desaturase (FADSD5 or A5FADS) introduces double bonds into 
a fatty acyl chain. The protein is involved in arachadonic acid synthesis and has a fatty acid 

20 desaturase domain and a heme-binding domain. Relevant sequence data for the protein, 
nucleic acids encoding A-5 fatty acid desaturase, and related sequences include the nucleic 
acid accession number NM_013402, SEQ ID NO:68; protein accession number NP_0375 34, 
SEQ ID NO:69; and the functional screen hit, SEQ ID NO:65-67. An antisense version of A- 
5 fatty acid desaturase gene was identified as a functional hit in the screen for modulators of 

25 angiogenesis. 

The dynactin p27 subunit protein was shown to be an angiogenesis regulatory 
protein. The protein has an RGD domain. Relevant sequence data for the protein, and the 
nucleic acid encoding dynactin p27 subunit protein, include the nucleic acid accession 
number NC_006571, SEQ ID NO:70; and protein accession number NPJ306562.1, SEQ ID 
30 NO:71 . An antisense version of the gene encoding dynactin p27 subunit protein was 
identified as a functional hit in the screen for modulators of angiogenesis. 

Elongation factor 1 alpha (EFlo) was shown to be involved in regulation of 
angiogenesis. Relevant sequence data for the protein, nucleic acids encoding EFla, and 
related sequences include the nucleic acid accession number NCJ301402, SEQ ID NO:76; 
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protein accession mu^^ NPJ301393, SEQ ID NO:77; and the functional screen hit, SEQ ID 
NO:72-75. 

Elongation factor 4 gamma (elF47) is a translation elongation factor. The 
protein also has a guanine nucleotide binding domain and a W2 domain. Relevant sequence 
5 data for the protein, nucleic acids encoding cytochrome C oxidase, and related sequences 
include the nucleic acid accession number NM_001418, SEQ ID NO:81; protein accession 
number NP_001409, SEQ ID NO:82; and the functional screen hit, SEQ ID NO:78-S0. An 
antisense version of the elF47 gene was identified as a functional hit in the screen for 
modulators of angiogenesis. The antisense hit encodes a 21 amino acid peptide that may also 

10 be useful as a therapeutic or to modulate angiogenesis. 

The filamin A protein binds integrin and regulates actin polymerization. 
Relevant sequence data for the protein, nucleic acids encoding filamin A, and related 
sequences include the nucleic acid accession number NM_001456, SEQ ED NO: 86; protein 
accession number NPJ)01447, SEQ ID NO:87; and the functional screen hit, SEQ ID 

15 NO:83-85. A sense version of the filamin A gene was identified as a functional hit in the 
screen for modulators of angiogenesis. 

Inactivation of the gene encoding the Gal 3 protein causes a severe vascular 
phenotype in mice due to an inhibition of angiogenesis (vasculogenesis was apparent). This 
protein couples to the thrombin receptor. Thrombin and Gal 3 protein activate Rho and 

20 tyrosine-phosphorylation of focal adhesion proteins FAK, Paxillin, and pl30Cas. Gal 3 also 
interacts with the cytoplasmic domain of adherins, leading to increased levels of b-catenin 
and transcriptional activation. Relevant sequence data for the protein, the nucleic acid 
encoding the Gal 3 protein, and related sequences include the nucleic acid accession number 
NMJ)06572, SEQ ID NO:89; protein accession number NP_006563.1, SEQ ID NO.90; and 

25 the functional screen hit, SEQ ID NO:88. A sense version of the Gal 3 gene was identified as 
a functional hit in the screen for modulators of angiogenesis. 

The HOXB2 protein is a transcription factor. Relevant sequence data for the 
protein, nucleic acids encoding HOXB2, and related sequences include the nucleic acid 
accession number NM_002145, SEQ ID NO: 120; protein accession number NP_002136, 

30 SEQ ID NO:121; and the functional screen hit, SEQ ID NO:91-l 19. A sense version of the 
HOXB2 gene was identified as a functional hit in the screen for modulators of angiogenesis. 

HSPA5, a member of the HSP70 family of proteins has a role in protein 
folding and assembly in the endoplasmic reticulum. Expression of the protein is glucose 
regulated. The protein has an ATPase domain which is fused in frame to GFP in the 
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functional screen hit. Relevant sequence data for the protein, nucleic acids encoding HSPA5, 
and related sequences include the nucleic acid accession number NMJ)05347, SEQ ID 
NO: 128; protein accession number NPJ)05338, SEQ ID NO: 129; a new sequence 
determination, SEQ ID NO: 127; and the functional screen hit, SEQ ID NO: 122-126. 
5 Interferon gamma receptor 1 (IFN7I) is a cytokine receptor that induces 

antiangiogenic genes. Relevant sequence data for the protein, nucleic acids encoding IFNyl, 
and related sequences include the nucleic acid accession number NM_000416, SEQ ID 
NO:139; protein accession number NPJ)00407, SEQ ID NO:140; and the functional screen 
hit, SEQ ID NO: 130-138. An antisense version of the EFN7I gene was identified as a 

10 functional hit in the screen for modulators of angiogenesis. 

Importin <x4 (karyopherin c3) is a subunit of the nuclear localization signal 
receptor. Relevant sequence data for the protein, nucleic acids encoding importin <x4 9 and 
related sequences include the nucleic acid accession number NM_002267, SEQ ED NO: 153; 
protein accession number NP_002258.1, SEQ ID NO: 154; and the functional screen hit, SEQ 

15 ID NO:141-152. An antisense version of the importin o:4 gene was identified as a functional 
hit in the screen for modulators of angiogenesis. . 

The lysosomal pepstatin-insensitive protease (CLN2) is encoded by a gene 
associated with juvenile neuronal ceroid lipofuscinosis disease. Relevant sequence data for 
the protein, nucleic acids encoding CLN2, and related sequences include the nucleic acid 

20 accession number NM_000391, SEQ ID NO: 163; protein accession number NP J)00382, 
SEQ ID NO:164; and the functional screen hit, SEQ ID NO: 155-162. An antisense version 
of the CLN2 gene was identified as a functional hit in the screen for modulators of 
angiogenesis. 

A novel protein disulfide isomerase was shown to have a role in angiogenesis. 

25 Relevant sequence data for the nucleic acid encoding the novel protein disulfide isomerase, 
includes SEQ ID NO: 165. An antisense version of gene encoding the novel protein disulfide 
isomerase was identified as a functional hit in the screen for modulators of angiogenesis. The 
antisense hit encodes a 45 or 32 amino acid peptide that may also be useful as a therapeutic or 
to modulate angiogenesis. 

30 Microtubule-associated protein 4 (MAP4) binds tubulin and may link 

microtubules to other elements of the cytoskeleton. See e.g., Chapin & Bulinski, J. Cell Sci 
98:27-36 (1991); and Chapin et a/., Biochemistry 34:2289-2301 (1995). Four isoforms of 
MAP4 are known. Relevant sequence data for the proteins, nucleic acids encoding MAP4 
isoforms, and related sequences include the nucleic acid accession numbers NMJ)02375, 
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NM_030884, NMJ)30885, and NM_030883; SEQ ID NOs:169, 171, 173, and 175; protein 
accession numbers NP_002366, NP_1 12146, NP_1 12147, and NP _1 12245; SEQ ID 
NOs:170, 172, 174, and 176; and the functional screen hit, SEQ ID NO:166-168. An 
antisense version of MAP4 was identified as a functional hit in the screen for modulators of 
5 angiogenesis. 

Neural precursor cell expressed developmentally down regulated 5 (Nedd5) is 
a septin GTPase. Relevant sequence data for the protein, nucleic acids encoding Nedd5, and 
related sequences include the nucleic acid accession number NM_004404, SEQ ID NO: 183; 
protein accession number NM_004395, SEQ ID NO: 184; and the functional screen hit, SEQ 

10 ID NO: 177-182. A sense version of the Nedd5 gene was identified as a functional hit in the 
screen for modulators of angiogenesis. 

A novel tyrosine/serine/threonine kinase was found to be involved in 
angiogenesis. The protein is also a breast cancer antigen. Relevant sequence data for the 
protein, nucleic acids encoding the novel kinase, and related sequences include the nucleic 

1 5 acid accession number NM_044379, SEQ ID NO:202; protein accession number XP_044379, 
SEQ ID NO:203; and the functional screen hit, SEQ ID NO:185-20L A sense version of the 
gene encoding the novel kinase was identified as a functional hit in the screen for modulators 
of angiogenesis. 

A novel protein encoded by a gene on human chromosome 1 was found to be 
20 involved in angiogenesis. The chromosome 1 DNA sequence identified was previously 
identified as intron 1 of the JAK1 gene. Relevant sequence data for chromosome 1 DNA 
sequence include a new sequence determination, SEQ ID NO:210; and the functional screen 
hit, SEQ ID NO:204-209. The functional screen hit is believed to be in the antisense 
orientation. 

25 A novel protein encoded by a gene on human chromosome 3 was found to be 

involved in angiogenesis. The chromosome 3 DNA sequence was previously identified as 
H41 exon 2/3 and part of intron 1 . Relevant sequence data for the chromosome 3 DNA 
sequence include a new sequence determination, SEQ ID NO:218; and the functional screen 
hit, SEQ ID NO:21 1-217. The functional screen hit is in the sense orientation. 

30 A novel protein encoded by a gene on human chromosome 17 was found to be 

involved in angiogenesis. Relevant sequence data for the chromosome 17 DNA sequence 
include a new sequence determination, SEQ ID NO:227; and the functional screen hit, SEQ 
JD NO:2 19-226. The functional screen hit is believed to be in the antisense orientation. 
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A novel protein encoded by a gene on human chromosome 8 was found to be 
involved in angiogenesis. Relevant sequence data for the chromosome 8 hit include a new 
sequence determination, SEQ ID NO:232; and the functional screen hit, SEQ ID NO:228- 
23 1 . The functional screen hit is believed to be in the antisense orientation or encodes a nine 
5 amino acid peptide that may also be useful as a therapeutic or to modulate angiogenesis. 

A novel protein encoded by a gene on human chromosome 9 was found to be 
involved in angiogenesis. The chromosome 9 DNA sequence was previously identified as 
intronic DNA from the RXRa gene. Relevant sequence data for chromosome 9 hit include a 
new sequence determination, SEQ ID NO:248; and the functional screen hit, SEQ ID 

10 NO:233-247. The functional screen hit is believed to be in the antisense orientation. 

A novel protein (1/226) that binds to PAK2 in a yeast two hybrid screen was 
found to be involved in angiogenesis. Relevant sequence data for the novel 1/226 hit include 
a new sequence determination, SEQ ID NO:274; and the functional screen hit, SEQ ID 
NO:249-273. The functional screen hit is in the sense orientation. 

1 5 Another novel protein FIJI 0688 contains fibronectin domains and binds to 

STAT-6 in a yeast two hybrid screen. Relevant sequence data for the protein, nucleic acids 
encoding FIJI 0688, and related sequences include the nucleic acid accession number 
NM_018179, SEQ ID NO:286; protein accession number NPJ)60649, SEQ ID NO:287; a 
new sequence determination, SEQ ID NO:285; and the functional screen hit, SEQ ID 

20 NO:275-284. The functional screen hit is in the sense orientation. 

Another novel protein KIAA0217 contains an RNA recognition motif. 
Relevant sequence data for the protein, nucleic acids encoding KIAA0217, and related 
sequences include the nucleic acid accession number XM_040265, SEQ ID NO:297; protein 
accession number NP_040265, SEQ ID NO:298; and the functional screen hit, SEQ ID 

25 NO:288-296. The functional screen hit is in the antisense orientation. 

Another novel protein KIAA1583 is homologous to PDZ-RGS, which 
regulates Ephrin B signaling. Relevant sequence data for the protein, nucleic acids encoding 
KIAA1583, and related sequences include the nucleic acid accession number AB046803, 
SEQ ID NO:308; protein accession number BAB13409, SEQ ID NO:309; a new sequence 

30 determination, SEQ ID NO:307; and the functional screen hit, SEQ ID NO:299-306. The 
functional screen hit is in the antisense orientation. The KIAA1583 transcript is spliced to 
the transcript of the A6-fatty acid desaturase enzyme, also located on chromosome 1 lql2. 
The antisense KIAA1583 functional screen hit, inhibits expression of A6-fatty acid 
desaturase mRNA. In addition, a A6-fatty acid desaturase specific siRNA decreased A6-fatty 
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acid desaturase mess^^evels and inhibited tube formation in a co-culture assay. Thus, A6- 
fatty acid desaturase is also an angiogenesis regulatory protein and the protein and its related 
nucleic acids can be used in the screening methods of the present invention. Compounds 
identified in such screens can be used in the therapeutic methods described herein. 
5 Representative A6-fatty acid desaturase amino acid sequence data is found at accession 
numbers NP_004256, SEQ ID NO:522; and AAD31282, SEQ ID NO:524. Related nucleic 
acid sequence data is found at accession numbers 1MM_004265, SEQ ID NO 521; and 
AF134404, SEQ ID NO:523. 

A novel protein, KIAA1814, was found to be involved in angiogenesis. 

10 Relevant sequence data for functional screen hit include the nucleic acid accession number 
XMJ)46822, SEQ ID NO:318; protein accession number XPJ)46822, SEQ ED NO:319; a 
new sequence determination, SEQ ID NO:317; and the functional screen hit, SEQ ID 
NO:310-316. The functional screen hit is in the sense orientation. 

A novel protein encoded by a gene on human chromosome 4 was found to be 

15 involved in angiogenesis. Relevant sequence data for functional screen hit include a new 
sequence determination, SEQ ID NO:320. 

Another novel protein identified as an angiogenesis regulatory protein is 
related to a peroxidasin/melanoma antigen. The protein has peroxidase, vWF, Ig, and leucine 
rich domains. Relevant sequence data for the protein, nucleic acids encoding the novel 

20 protein, and related sequences include the nucleic acid accession number XM_056455, SEQ 
ID NO:324; protein accession number XP_056455, SEQ ID NO:325; a new sequence 
determination, SEQ ID NO:323; and the functional screen hit, SEQ ID NO:321-322. The 
functional screen hit is in the sense orientation comprising a fusion to the peroxidase domain. 

Another novel protein identified herein as an angiogenesis regulatory protein 

25 is a novel WD40/SOCS box protein. Relevant sequence data for the protein, nucleic acids 
encoding the novel WD40/SOCS box protein, and related sequences include the nucleic acid 
accession number NMJ)1 8639, SEQ ID NO:330; protein accession number NPJ361 109, 
SEQ ID NO:331; a new sequence determination, SEQ ID NO:329; and the functional screen 
hit, SEQ ID NO:326-328. The functional screen hit is in the antisense orientation. 

30 A Numb-like protein contains adapter and PTB domains and is involved in 

regulation of angiogenesis. Relevant sequence data for the protein, nucleic acids encoding 
the Numb-like protein, and related sequences include the nucleic acid accession number 
NMJ)04756, SEQ ID NO:340; protein accession number NP_004747, SEQ ID NO:34l; and 
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the functional screen hit, SEQ ID NO:332-339. The functional screen hit is in the sense 
orientation. 

Protein pl30CAS (Crk-associated substrate) was identified as an angiogenic 
regulatory protein, pi 30 CAS is an adapter protein and has SH3 domains. (Dorssers et al, 
Mol Endocrinol 7:870-878 (1993); Brinkman et al, J. Natl Cancer Inst 92:112-120 
(2000)). Relevant sequence data for the protein, nucleic acids encoding FLJ10688, and 
related sequences include the nucleic acid accession number NM_014567, SEQ ID NO:351; 
protein accession number NP_055382, SEQ ID NO:352; and the functional screen hit, SEQ 
ID NO:342-350. The orientation of this functional screen hit is antisense. 

Perlecan, a basement membrane component, was identified as an angiogenesis 
regulatory protein. (Nicole et al, Nat Genet 26:(4) 480-483 (2000)). Relevant sequence 
data for the protein, nucleic acids encoding perlecan, and related sequences include the 
nucleic acid accession number NM_005529, SEQ ID NO:365; protein accession number 
NPJ)05520, SEQ ID NO:366; and the functional screen hit, SEQ ID NO:353-364. The 
functional screen hit is in the antisense orientation. 

The T cell activation protein PGR-1, was identified as an angiogenesis 
regulatory protein. Relevant sequence data for the protein, nucleic acids encoding PGR-1, 
and related sequences include the nucleic acid accession number NM_033296, SEQ ID 
NO:377; protein accession number NPJ5063S, SEQ ID NO:378; and the functional screen 
hit, SEQ ID NO:367-376. The functional screen hit is in the sense orientation. 

Phosphoribosyl pyrophosphate synthetase associated protein 1 (PRPSAP1), 
was identified as an angiogenesis regulatory protein. PRPSAP1 regulates a nucleotide 
synthesis pathway. Relevant sequence data for the protein, nucleic acids encoding 
PRPSAP1, and related sequences include the nucleic acid accession number NM_002766, 
SEQ ID NO:384; protein accession number NPJ)02757, SEQ ID NO:385; and the functional 
screen hit, SEQ ID NO:379-383. The functional screen hit is in the sense orientation. 

Plakoglobin, a member of the catenin family, was identified as an 
angiogenesis regulatory protein. Plakoglobin binds to E-cadherin, N-cadherin, and 
desmoglein proteins. Relevant sequence data for the protein, nucleic acids encoding 
plakoglobin, and related sequences include the nucleic acid accession numbers NM_002230, 
SEQ ID NO:406, NM 021991, SEQ ID NO:408; protein accession numbers NP_002221, 
SEQ ID NO:407, NPJ)68831, SEQ ID NO:409; and the functional screen hit, SEQ ID 
NO:3 86-405. Two functional screen hits were identified: one in the sense orientation, fused 
to GFP; and one in the antisense orientation. 
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Plasminogen activator inhibitor 1 (SERPINE1 or PAI-1) was identified as an 
angiogenesis regulatory protein. PAI-1 is a protease inhibitor. Relevant sequence data for 
the protein, nucleic acids encoding PAI-1, and related sequences include the nucleic acid 
accession number NM_000602, SEQ ID NO:419; protein accession number NPJ)00593, 
5 SEQ ID NO:420; and the functional screen hit, SEQ ID NO:410-418. The functional screen 
hit is in the antisense orientation. 

Proteosomal subunit Y, an INF7 regulated proteosomal subunit, was identified 
as an angiogenesis regulatory protein. Relevant sequence data for proteosomal subunit Y 
includes the functional screen hit, SEQ ID NO:421. The functional screen hit is in the 
10 antisense orientation. 

Rap2B was identified as an angiogenesis regulatory protein. Rap2B is a 
member of the RAS family of GTPases. Relevant sequence data for the protein, nucleic acids 
encoding Rap2B, and related sequences include the nucleic acid accession number 
XM_003032, SEQ ID NO:428; protein accession number XP_003032, SEQ ID NO:429; and 
15 the functional screen hit, SEQ ID NO:422-427. The functional screen hit is in the antisense 
orientation. 

Semaphorin 3F was identified as an angiogenesis regulatory protein. 
Semaphorin 3F is a ligand for NP2. Relevant sequence data for the protein, nucleic acids 
encoding semaphorin 3F, and related sequences include the nucleic acid accession number 
20 U38276, SEQ ED NO:437; protein accession number AAB 18276, SEQ ID NO:438; and the 
functional screen hit, SEQ ID NO:430-436. The functional screen hit is in the antisense 
orientation. 

SPARC was identified as an angiogenesis regulatory protein. Relevant 
sequence data for the protein, nucleic acids encoding SPARC, and related sequences include 
25 the nucleic acid accession number NM 0031 18, SEQ ID NO:439; and protein accession 
number NPJ)03109, SEQ ID NO:440. 

ssDNA binding protein- 1 was identified as an angiogenesis regulatory protein. 
Relevant sequence data for the protein, nucleic acids encoding ssDNA binding protein- 1, and 
related sequences include the nucleic acid accession number NM_0 18070, SEQ ID NO:445; 
30 protein accession number NP_060540, SEQ ID NO:446; and the functional screen hit, SEQ 
ID NO :44 1-444. The functional screen hit is in the sense orientation. 

Sumo protease (SUSP-1) was identified as an angiogenesis regulatory protein. 
Relevant sequence data for the protein, nucleic acids encoding SUSP-1, and related 
sequences include the nucleic acid accession number NM_015571, SEQ ID NO:456; protein 
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accession number NP_056386, SEQ ID NO:457; and the functional screen hit, SEQ ID 
NO:447-455. The functional screen hit is in the sense orientation and includes the 5 ! 
untranslated region of the gene. 

Pantophysin, a synaptophysin homolog, was identified as an angiogenesis 
5 regulatory protein (Zhong et ah, Biochim. Biophys. Acta 1 129:235-238 (1992)). Pantophysin 
was previously shown to be involved in vesicle trafficking. Relevant sequence data for the 
protein, nucleic acids encoding pantophysin, and related sequences include the nucleic acid 
accession number NMJ)06754, SEQ ID NO:462; protein accession number NP_006745, 
SEQ ID NO:463; and the functional screen hit, SEQ ID NO:458-461 . The functional screen 
10 hit is in the antisense orientation. 

Thrombospondin, a known regulator of angiogenesis, was also identified as a 
sense hit fused to the laminin domain. Relevant sequence data for the protein, nucleic acids 
encoding thrombospondin, and related sequences include the nucleic acid accession number 
NM_0003246, SEQ ID NO:481; protein accession number NP_003237, SEQ ID NO:482; 
1 5 and the functional screen hit, SEQ ID NO:464-480. 

Transgelin 2 was identified as an angiogenesis regulatory protein. Relevant 
sequence data for the protein, nucleic acids encoding transgelin 2, and related sequences 
include the nucleic acid accession number NM_003564, SEQ ID NO:484; protein accession 
number NP_003555, SEQ ID NO:485; and the functional screen hit, SEQ ED NO:483. The 
20 functional screen hit is in the sense orientation. 

Vigilin was identified as an angiogenesis regulatory protein. Relevant 
sequence data for the protein, nucleic acids encoding vigilin, and related sequences include 
the nucleic acid accession number NMJ)05336, SEQ ID NO:493; protein accession number 
NP_005327, SEQ ID NO:494; and the functional screen hit, SEQ ID NO:486-492. The 
25 functional screen hit is in the sense orientation. 

Vimentin was identified as an angiogenesis regulatory protein. Relevant 
sequence data for the protein, nucleic acids encoding vimentin, and related sequences include 
the nucleic acid accession number NM_003380, SEQ ID NO:496, XMJ67414, SEQ ID 
NO:498; protein accession number NPJ)03371, SEQ ID NO:497, XP_1 67414, SEQ ID 
30 NO:499; and the functional screen hit, SEQ ID NO:495. 

Vinexin (SCAM-1) was identified as an angiogenesis regulatory protein. 
SCAM-1 is an adaptor protein and has an SH3 domain. Relevant sequence data for the 
protein, nucleic acids encoding SCAM-1, and related sequences include the nucleic acid 
accession number NMJ)05 775, SEQ ID NO:519; protein accession number NP_005766, 
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SEQ ID NO:520; and the functional screen hit, SEQ ID NO:500-518. The functional screen 
hit is in the antisense orientation. 

Some of the functional screen hits can be grouped together on the basis of 
other cellular functions. For example, plakoglobin, the Gal3 protein, and pl30CAS protein 
5 all appear to function in cell adhesion. The A-5 fatty acid desaturase, the ALD protein (ABC 
transporter), cytochrome C oxidase, and the novel peroxidasin protein all may be important in 
arachadonic acid synthesis. The novel protein disulfide isomerase may be involved in 
collagen biosynthesis. Pantophysin, HSPA5, and dynactin p27, may all be involved in 
vesicle trafficking. Without wishing to be bound by theory, regulation of any of these 

10 cellular functions may influence the angiogenesis pathway. In addition, without wishing to 
be bound by theory, both the proteins and/or the nucleic acids identified in the assays are 
regulators of angiogenesis, e.g., by antisense activity, by overexpression of a functional 
protein, or as dominant negative inhibitors, or small peptide inhibitors. 

Angiogenesis assays described herein reveal for the first time that expression 

15 of a nucleic acid molecule encoding the above described angiogenesis regulatory proteins 
exerted a negative effect on av/33 surface expression. In addition, endothelial cells 
expressing the sequences were strongly inhibited in their haptotactic response to vitronectin, 
which is an indicator of an anti-angiogenic phenotype. Without wishing to be bound by 
theory, it appears that the angiogenesis regulatory proteins or fragments of those proteins, 

20 peptides derived from the proteins, or peptides and inhibitory DNA or RNA molecules 

derived from DNA encoding the proteins, provide an anti-angiogenic phenotype. Thus, in 
addition to their use in screens for modulators of angiogenesis, the angiogenesis regulatory 
proteins or fragments of those proteins, peptides derived from the proteins, or peptides and 
inhibitory DNA or RNA molecules derived from DNA encoding the proteins, can also be 

25 used as therapeutics for treatment of neovascularization or for treatment of angiogenic 
insufficiency. 

The angiogenesis regulatory proteins, nucleic acids, and other members of the 
angiogenesis pathway therefore represent targets for the development of angiogenic drugs, 
preferably anti-angiogenic drugs, e.g., anti-angiogenic drugs for treatment of 
30 neovascularization. Neovascularization can be caused by pathologic states including cancer, 
diabetic retinopathy, glaucoma, age related macular degeneration, lymphoma, hematologic 
cancers, melanoma, breast cancer, lung cancer, prostate cancer, colan cancer, ovarian cancer, 
liver cancer, Kaposi's sarcoma, metastatic disease, rheumatoid arthritis, psoriasis, and benign 
proliferative disorders including hemangiomas. The angiogenesis regulatory proteins, 
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nucleic acids, and other members of the angiogenesis pathway also represent targets for the 
development of angiogenic drugs for treatment of angiogenic insufficiency, e.g., stroke, 
infertility, heart disease, ulcers, delayed wound healing, and scleroderma, are diseases of 
angiogenic insufficiency. Modulators include small organic molecules, nucleic acids, 
5 peptides, cyclic peptides, antibodies, antisense molecules, RNAi molecules, and ribozymes. 
The nucleic acids and proteins of the invention are also useful for diagnostic applications, 
using, e.g., nucleic acid probes, oligonucleotides, and antibodies. 

Definitions 

10 By "disorder associated with angiogenesis" or "disease associated with 

angiogenesis" herein is meant a disease state which is marked by either an excess or a deficit 
of vessel development. Angiogenesis disorders associated with increased angiogenesis 
include, but are not limited to, cancer and proliferative diabetic retinopathy. Pathological 
states for which it may be desirable to increase angiogenesis include stroke, heart disease, 

15 infertility, ulcers, and scleredema. An increase in angiogenesis may also be desirable in 
transplantation or for artificial or in vitro growth of organs. 

The following table (Table 1) presents the angiogenesis regulatory proteins 
and nucleic acids identified in a screen for such proteins and nucleic acids. Accession 
numbers and SEQ ID NOs are also provided for each angiogenesis regulatory protein. While 

20 the angiogenesis regulatory proteins and nucleic acids are referred to as SEQ ID NOs 
throughout the present application, the term SEQ ID NO encompasses orthologues and 
homologues of each angiogenesis regulatory proteins or nucleic acids identified by a SEQ ID 
NO. 



TABLE 1 : ANGIOGENESIS REGULATORY PROTEINS AND NUCLEIC ACIDS 



Name 


Nucleic Acid 

Accession 

Number 


Nucleic Acid 
SEQ ID NO 


Protein 

Accession 

Number 


Protein 
SEQ ID NO 


ABC transporter 


NM_000033 


SEQ ID NO:3 


NP_000024 


SEQ ID NO:4 


PCB1 


NM_006196 


SEQIDNO:32 


NP_006187 


SEQIDNO:33 


SLC1A5 


NM_005628 


SEQ ID NO:43 


NP_005619 


SEQ ID NO:44 


Chromobox 
homolog 6 


NM_014292 


SEQ ID NO:57 


NP_055107 


SEQlDNO:58 


Cytochrome C 


NC 001807 


SEQ ID NO:63 


NP_536845 


SEQ ID NO:64 
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Name 


Nucleic Acid 

Accession 

Number 


Nucleic Acid 
SEQ ID NO 


Protein 

Accession 

Number 


Protein 
SEQ ID NO 


oxidase subunit 
1 










A-5 fatty acid 
desaturase 


NM_013402 


SEQ ID NO:68 


NP_037534 


SEQBDNO:69 


dynactin p27 
subunit protein 


NC_006571 


SEQ ID NO:70 


NP_006562.1 


SEQIDNO:71 


Elongation 
factor 1 alpha 


NC_001402 


SEQ ID NO:76 


NP_001393 


SEQ ID NO:77 


Elongation 
factor 4 gamma 


NM_001418 


SEQIDNO:81 


NP_001409 


SEQ ID NO:82 


filamin A 
protein 


NM_001456 


SEQIDNO:86 


NP_001447 


SEQIDNO:87 


Gal 3 protein 


NM_006572 


SEQ ID NO:S9 


NP_006563.1 


SEQD3NO:90 


HOXB2 protein 


NM_002145 


SEQ ID NO: 120 


NP_002136 


SEQE)NO:121 


HSPA5 


NM_005347 


SEQ ID NO: 128 


NP_005338 


SEQIDNO:129 


Interferon 
gamma receptor 
1 


NM_000416 


SEQ ED NO: 139 


NP_000407 


SEQ ID NO: 140 


Importin a.4 


NM_002267 


SEQ ID NO: 153 


NP_002258.1 


SEQ ID NO: 154 


lysosomal 
pepstatin- 
insensitive 
protease (CLN2) 


NM_000391 


SEQ ID NO: 163 


NP_000382 


SEQ ID NO: 164 


novel protein 
disulfide 
isomerase 




SEQ ID NO: 165 






Microtubule- 
associated 
protein 4 
(MAP4) 


NM_002375 
NM_030884 
NM_03O885 
NM_030883 


SEQIDNO:169 
SEQ ID NO: 171 
SEQ ID NO: 173 
SEQ ID NO: 175 


NP_002366 
NP_1 12146 
NP_1 12147 
NP_1 12245 


SEQ ID NO: 170 
SEQ ID NO: 172 
SEQIDNO:174 
SEQ ID NO: 176 
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Name 


Nucleic Acid 

Accession 

Number 


Nucleic Acid 
SEQ ID NO 


Protein 

Accession 

Number 


Protein 
SEQ ID NO 


Nedd5 


NM_004404 


SEQIDNO:183 


NM_004395 


SEQ ID NO: 184 


novel tyr/ser/thr 
kinase 


NM_044379 


SEQ ID NO:202 


XP_044379 


SEQIDNO:203 


human 
chromosome 1 
protein 




SEQ ID NO:210 






human 
chromosome 3 
protein 




SEQIDNO:218 






human 
chromosome 17 
protein 




SEQ ID NO:227 






human 
chromosome 8 
protein 




SEQIDNO:232 






human 
chromosome 9 
protein 




SEQ ID NO:248 






1/226 




SEQ ID NO.-274 






FLJ10688 


NM018179 


SEQ ID NO:286 


NP_060649 


SEQ ID NO:287 


KIAA0217 


XM_040265 


SEQ ID NO:297 


NP_040265 


SEQ ID NO:298 


KIAA1583 


AB046803 


SEQ ID NO-.308 


BAB13409 


SEQ ID NO:309 


KIAA1814 


XM_046822 


SEQIDNO:318 


XP_046822 


SEQIDNO:319 


human 
chromosome 4 
protein 




SEQ ID NO:320 






peroxidasin- 
melanoma 
antigen related 
protein 


XM_056455 


SEQ ID NO:324 


XP_056455 


SEQ ID NO:325 
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Name 


Nucleic Acid 

Accession 

Number 


Nucleic Acid 
SEQIDNO 


Protein 

Accession 

Number 


Protein 
SEQIDNO 


WD40/SOCS 
box protein 


NM_01S639 


SEQ ID NO:330 


NP_061109 


SEQIDNO:331 


Numb-like 
protein 


NM_004756 


SEQ ID NO:340 


NP_004747 


SEQ ID NO:341 


P 130CAS 


NM_014567 


SEQIDNO:351 


NP_055382 


SEQIDNO:352 


Perlecan 


NM_005529 


SEQ ID NO:365 


NP_005520 


SEQIDNO:366 


PGR-1 


NM_033296 


SEQ ID NO:377 


NP_1 50638 


SEQ ID NO:378 


PRPSAP1 


NM_002766 


SEQ ID NO:384 


NP_002757 


SEQEDNO:385 


Plakoglobin 


NM_002230, 
NM_021991 


SEQ ID NO:406 
SEQIDNO:408 


NP_002221 
NP_068831 


SEQ ID NO.407 
SEQE>NO:409 


SERPINE1 or 
PAI-1 


NM_000602 


SEQ ID NO:419 


NP_000593 


SEQ ID NO:420 


Proteosomal 
subunit Y 




SEQIDNO:421 






Rap2B 


XM 003032 

i 


SEQ ID NO:428 


XP_003032 


SEQlDNO:429 


Semaphorin 3F 


U38276 


SEQ ID NO:437 


AAB 18276 


SEQIDNO:438 


SPARC 


NM_003118 


SEQ ID NO:439 


NP_003109 


SEQ ID NO:440 


ssDNA binding 
protein- 1 


NM_018070 


SEQ ID NO:445 


NP_060540 


SEQ ID NO:446 


SUSP-1 


NM_015571 


SEQ ID NO:456 


NP_056386 


SEQ ID NO:457 


Pantophysin 


NM_0003246 


SEQIDNO:481 


NP_003237 


SEQ ID NO:482 


Transgelin 2 


NM_003564 


SEQ ID NO:484 


NP_003555 


SEQIDNO:485 


Vigilin 


NM_005336 


SEQ ID NO:493 


NP_005327 


SEQIDNO:494 


Vimentin 


NM_003380 
XM_167414 


SEQ ID NO:496 
SEQ ID NO:498 


NP_003371 
XP_1 67414 


SEQ ID NO:497 
SEQIDNO.-499 


Vinexin 
(SCAM-1) 


NM_005775 


SEQE)NO:519 


NP_005766 


SEQ ID NO:520 


A-6 fatty acid 
desaturase 


NM_004265 
AF134404 


SEQ ID NO 521 
SEQ ID NO:523 


NP_004256 
AAD31282 


SEQ ID NO:522 
SEQ ID NO:524 
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The terms "angiogenesis regulatory protein or nucleic acid" or a nucleic acid 
encoding "angiogenesis regulatory protein" refer to nucleic acid and polypeptide 
polymorphic variants, alleles, mutants, and interspecies homologs that: (1) have an amino 
acid sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
5 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino acid sequence identity, preferably over a region of over a region of at least about 25, 
50, 100, 200, 500, 1000, or more amino acids, to a polypeptide encoded by a nucleic acid of 
SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID 
NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, 

10 SEQ ID NO: 120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, 
SEQ ID NO-.165, SEQ ID NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, 
SEQ ID NO:183, SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, 
SEQ ID NO:232, SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, 
SEQ ID NO.-297, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, 

15 SEQ ID NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, 
SEQ ID NO:340, SEQ ID NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, 
SEQ ID NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, 
SEQ ID NO:437, SEQ ID NO:439 SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ 
ID NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO.-498, and SEQ 

20 ID NO:5 19, or an amino acid sequence of SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, 
SEQ ED NO:58, SEQ ID NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID 
NO:82, SEQ ID NO:87, SEQ ID NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID 
NO:140, SEQ ID NO:154, SEQ ID NO:164, SEQ ID NO.170, SEQ ID NO:172, SEQ ID , 
NO:174, SEQ ID NO:176, SEQ ID NO:184, SEQ ID NO:203, SEQ ID NO:287, SEQ ID 

25 NO:298, SEQ ID NO:309, SEQ ID NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID 
NO:341, SEQ ID NO:352, SEQ ID NO:366, SEQ ID NO:378, SEQ ID NO:385, SEQ ID 
NO:407, SEQ ID NO:409, SEQ ID NO:420, SEQ ID NO:429, SEQ ID NO:438, SEQ ID 
NO:440, SEQ ID NO:446, SEQ ID NO:457, SEQ ID NO:463, SEQ ID NO:482, SEQ ID 
NO:485, SEQ ID NO:494, SEQ ID NO:497, SEQ ID NO:499, SEQ ID NO:520, SEQ ID 

30 NO:522, and SEQ ID NO:524or other sequences listed herein; (2) specifically bind to 

antibodies, e.g., polyclonal antibodies, raised against an immunogen comprising an amino 
acid sequence of SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ ID 
NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO:82, SEQ ID NO:87, 
SEQ ID NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID NO: 140, SEQ ID NO:154, SEQ 
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ID NO:164, SEQ ID NO:170, SEQ ID NO:172, SEQ ID NO:174, SEQ ID NO:176, SEQ ID 
N0:1S4, SEQ ID NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID 
NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID NO:341, SEQ ID NO:352, SEQ ID 
NO:366, SEQ ID NO:378, SEQ ID NO:3S5, SEQ ID NO:407, SEQ ID NO:409, SEQ ID 
NO:420, SEQ ID NO:429, SEQ ID NO:43S, SEQ ID NO:440, SEQ ID NO:446, SEQ ID 
NO:457, SEQ ID N0.463, SEQ ID NO:482, SEQ ID NO.485, SEQ ID NO:494, SEQ ID 
NO:497, SEQ ID NO:499, SEQ ID NO:520, SEQ ID NO:522, and SEQ ID NO:524, 
immunogenic fragments thereof, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid encoding 
SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ ID NO:64, SEQ ID 
NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO:82, SEQ ID NO:87, SEQ ID NO:90, 
SEQ ID NO:121, SEQ ID NO:129, SEQ ID NO:140, SEQ ID NO:154, SEQ ID NO:164, 
SEQ ID NO:170, SEQ ID NO:172, SEQ ID NO:174, SEQ ID NO:176, SEQ ID NO:184, 
SEQ ID NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID NO:319, 
SEQ ID NO:325, SEQ ID NO:331, SEQ ID NO:341, SEQ ID NO:352, SEQ ID NO:366, 
SEQ ID NO:378, SEQ ID NO:385, SEQ ID NO:407, SEQ ED NO:409, SEQ ED NO:420, 
SEQ ID NO:429, SEQ ID NO:438, SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:457, 
SEQ ID NO:463, SEQ ID NO:482, SEQ ED NO:485, SEQ ED NO:494, SEQ ID NO:497, 
SEQ ID NO-.499, SEQ ID NO:520, SEQ ID NO:522, and SEQ ID NO:524, e.g., a nucleic 
acid sequence of SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ DD NO:57, SEQ ED 
NO:63, SEQ ED NO:68, SEQ ID NO.70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, 
SEQ ID NO:89, SEQ ID NO.120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ 
ID NO:163, SEQ ID NO:165, SEQ ED NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ DD 
NO:175, SEQ ED NO:183, SEQ ID NO.202, SEQ ED NO.210, SEQ ID NO:218, SEQ ID 
N0.227, SEQ ID NO:232, SEQ ID NO:248, SEQ ED NO:274, SEQ ID NO:285, SEQ ID 
NO:286, SEQ ID NO:297, SEQ ED NO:307, SEQ ED NO:308, SEQ ID NO:317, SEQ ID 
NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ED 
NO:330, SEQ ID NO:340, SEQ ID NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID 
NO:384, SEQ ID NO:406, SEQ ID NO:408, SEQ ED NO:419, SEQ ID NO:421, SEQ ID 
NO:428, SEQ ID NO:437, SEQ ID NO:439 SEQ ID NO:445, SEQ ID NO:456, SEQ ID 
NO:462, SEQ ED NO:481, SEQ ID NO:484, SEQ ED NO:493, SEQ ID NO:496, SEQ ID 
NO:498, SEQ ID NO:519, SEQ ED NO:521, and SEQ ED NO:523; or its complement, and 
conservatively modified variants thereof; (4) have a nucleic acid sequence that has greater 
than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or higher nucleotide 
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sequence identity, preferably over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ 
ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, 
SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ 
ID NO:163, SEQ ID NO:165, SEQ ID NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID 
N0:175, SEQ ID NO:183 9 SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID 
NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID 
NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID 
NO.-318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID 
NO:330, SEQ ID NO:340, SEQ ID NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID 
NO.-384, SEQ ID NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID 
NO:428, SEQ ID NO:437, SEQ ID NO:439 SEQ ID NO:445, SEQ ID NO:456, SEQ ID 
NO:462, SEQ ID NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID 
NO:498, SEQ ID NO:519, SEQ ID NO:521, and SEQ ID NO:523, or the complement of any 
of those sequences. A polynucleotide or polypeptide sequence is typically from a mammal 
including, but not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal. The nucleic acids and proteins of the invention include both 
naturally occurring or recombinant molecules. When available, accession numbers for the 
human angiogenesis regulatory proteins and genes are provided. 

The phrase "functional effects" in the context of assays for testing compounds 
that modulate activity of a angiogenesis regulatory proteins includes the determination of a 
parameter that is indirectly or directly under the influence of a angiogenesis regulatory 
protein polypeptide, e.g., a chemical or phenotypic effect such as loss-of angiogenesis 
phenotype represented by a change in expression of a cell surface marker, such as av/33 
integrin, or changes in cellular proliferation, especially endothelial cell proliferation; or 
enzymatic activity, or, e.g., a physical effect such as ligand binding or inhibition of ligand 
binding. A functional effect therefore includes ligand binding activity, the ability of cells to 
proliferate, expression in cells undergoing angiogenesis, and other characteristics of 
angiogenic cells. "Functional effects" include in vitro, in vivo, and ex vivo activities. 
Angiogenesis assays are described, e.g., in Angiogenesis Protocols (Murray, ed, 2001). 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 
angiogenesis regulatory proteins, e.g., measuring physical and chemical or phenotypic 
effects. Such functional effects can be measured by any means known to those skilled in the 
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art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive 
index); hydrodynamic (e.g., shape), chromatographic; or solubility properties for the protein; 
measuring inducible markers or transcriptional activation of the angiogenesis protein; 
measuring changes in cell surface markers, e.g., av/?3 integrin; and measuring cellular 
5 proliferation, particularly endothelial cell proliferation. Functional effects can be measured 
using ligand binding assays, e.g., binding to antibodies or other molecules. Functional effects 
can be determined by measuring changes in enzymatic activity. Examples of enzymatic 
activities include, but are not limited to protein translation, electron transfer, fatty acid 
desaturase activity, protein disulfide isomerase activity, transcriptional activation activity, 

10 peroxidase activity, kinase activity, and proteolytic activity. 

Determination of the functional effect of a compound on angiogenesis can also 
be performed using angiogenesis assays known to those of skill in the art such as avj33 
downregulation, VEGF-2 upregulation, endothelial cell tube formation assays; chemotaxis 
assays, haptotaxis assays; differentiation assays using matrigel or co-culture with smooth 

15 muscle cells, the chick CAM assay; the mouse corneal assay; and assays that assess 

vascularization of an implanted tumor. The functional effects can be evaluated by many 
means known to those skilled in the art, e.g., microscopy for quantitative or qualitative 
measures of alterations in morphological features, e.g., tube or blood vessel formation, 
measurement of changes in RNA or protein levels for angiogenesis-associated sequences, 

20 measurement of RNA stability, identification of downstream or reporter gene expression 
(CAT, luciferase, /?-gal, GFP and the like), e.g., via chemiluminescence, fluorescence, 
colorimetric reactions, antibody binding, inducible markers, etc. 

"Ligand" refers to a molecule that is specifically bound by a protein. An 
antibody is one example of a ligand. Other examples of ligands include RNA, cytochrome C, 

25 0 2 , NADP or NADPH, heme, MHC molecules, an amino acid sequence comprising a nuclear 
localization signal, tubulin or other cytoskeletal proteins, the thrombin receptor, transcription 
factor 7, cadherins, and ssDNA. 

"Substrate" refers to a molecule that binds to an enzyme and is part of a 
specific chemical reaction catalyzed by the enzyme. Substrates of angiogenesis regulatory 

30 proteins include ATP> fatty acids, GTP, 0 2 , arachadonic acids or its precursors, collagen, and 
protein molecules or peptides. 

"Inhibitors," "activators," and "modulators" of angiogenesis regulatory protein 
polynucleotide and polypeptide sequences are used to refer to activating, inhibitory, or 
modulating molecules identified using in vitro and in vivo assays of angiogenesis regulatory 
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protein polynucleotide and polypeptide sequences. Inhibitors are compounds that, e.g., bind 
to, partially or totally block activity, decrease, prevent, delay activation, inactivate, 
desensitize, or down regulate the activity or expression of angiogenesis regulatory proteins, 
e.g., antagonists. "Activators" are compounds that increase, open, activate, facilitate, 
5 enhance activation, sensitize, agonize, or up regulate angiogenesis regulatory proteins 

activity, e.g., agonists. Inhibitors, activators, or modulators also include genetically modified 
versions of angiogenesis regulatory proteins, e.g., versions with altered activity, as well as 
naturally occurring and synthetic ligands, antagonists, agonists, antibodies, peptides, cyclic 
peptides, nucleic acids, antisense molecules, ribozymes, RNAi, small organic molecules and 

10 the like. Such assays for inhibitors and activators include, e.g., expressing angiogenesis 

regulatory proteins in vitro, in cells, or cell extracts, applying putative modulator compounds, 
and then determining the functional effects on activity, as described above. 

Samples or assays comprising angiogenesis regulatory proteins that are treated 
with a potential activator, inhibitor, or modulator are compared to control samples without 

1 5 the inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of angiogenesis regulatory protein is achieved when the activity value relative to the control 
is about 80%, preferably 50%, more preferably 25-0%. Activation of angiogenesis regulatory 
protein is achieved when the activity value relative to the control (untreated with activators) 

20 is 1 10%, more preferably 150%, more preferably 200-500% (i.e., two to five fold higher 
relative to the control), more preferably 1000-3000% higher. 

The term "test compound'* or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, either naturally occurring or synthetic, 
e.g., protein, oligopeptide (e.g., from about 5 to about 25 amino acids in length, preferably 

25 from about 10 to 20 or 12 to 18 amino acids in length, preferably 12, 15, or 1 8 amino acids in 
length), small organic molecule, polysaccharide, lipid, fatty acid, polynucleotide, 
oligonucleotide, etc., to be tested for the capacity to directly or indirectly modulate 
angiogenesis. The test compound can be in the form of a library of test compounds, such as a 
combinatorial or randomized library that provides a sufficient range of diversity. Test 

30 compounds are optionally linked to a fusion partner, e.g., targeting compounds, rescue 

compounds, dimerization compounds, stabilizing compounds, addressable compounds, and 
other functional moieties. Conventionally, new chemical entities with useful properties are 
generated by identifying a test compound (called a "lead compound") with some desirable 
property or activity, e.g., inhibiting activity, creating variants of the lead compound, and 
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evaluating the property ana activity of those variant compounds. Often, high throughput 
screening (HTS) methods are employed for such an analysis. 

A "small organic molecule" refers to an organic molecule, either naturally 
occurring or synthetic, that has a molecular weight of more than about 50 daltons and less 
5 than about 2500 daltons, preferably less than about 2000 daltons, preferably between about 
100 to about 1000 daltons, more preferably between about 200 to about 500 daltons. 

"RNAi molecule" or an "siRNA" refers to a nucleic acid that forms a double 
stranded RNA, which double stranded RNA has the ability to reduce or inhibit expression of 
a gene or target gene when the siRNA expressed in the same cell as the gene or target gene. 
10 "siRNA" thus refers to the double stranded RNA formed by the complementary strands. The 
complementary portions of the siRNA that hybridize to form the double stranded molecule 
typically have substantial or complete identity. In one embodiment, an siRNA refers to a 
nucleic acid that has substantial or complete identity to a target gene and forms a double 
stranded siRNA. The sequence of the siRNA can correspond to the full length target gene, or 
1 5 a subsequence thereof. Typically, the siRNA is at least about 1 5-50 nucleotides in length 
(e.g., each complementary sequence of the double stranded siRNA is 15-50 nucleotides in 
length, and the double stranded siRNA is about 15-50 base pairs in length, preferable about 
preferably about 20-30 base nucleotides, preferably about 20-25 nucleotides in length, e.g., 
20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length. 
20 "Biological sample" include sections of tissues such as biopsy and autopsy 

samples, and frozen sections taken for histologic purposes. Such samples include blood, 
sputum, tissue, cultured cells, e.g., primary cultures, explants, and transformed cells, stool, 
urine, etc. A biological sample is typically obtained from a eukaryotic organism, most 
preferably a mammal such as a primate, e.g., chimpanzee or human; cow; dog; cat; a rodent, 
25 e.g., guinea pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
30 96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., SEQ ID NO: 1 or 2), 
when compared and aligned for maximum correspondence over a comparison window or 
designated region) as measured using a BLAST or BLAST 2.0 sequence comparison 
algorithms with default parameters described below, or by manual alignment and visual 
inspection {see, e.g., NCBI web site or the like). Such sequences are then said to be 
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"substantially identical." This definition also refers to, or may be applied to, the compliment 
of a test sequence. The definition also includes sequences that have deletions and/or 
additions, as well as those that have substitutions. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 
to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. 
Acad. Set USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection {see, e.g., Cim-ent Protocols in Molecidar Biology (Ausubel et aL, eds. 
1995 supplement)). 

A preferred example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et at, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et aL, J. 
Mol. Biol. 215:403-410 (1990), respectively. BLAST and BLAST 2.0 are used, with the 
parameters described herein, to determine percent sequence identity for the nucleic acids and 
proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
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This algorithm involvesHrst identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et aL 9 supra). 
5 These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence for as 
far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
10 acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
15 sensitivity and speed of the alignment The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proa Natl Acad, Sci. USA 89:10915 (1989)) alignments (B) of 
20 50, expectation (E) of 10, M=5, N— 4, and a comparison of both strands. 

"Nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers 
thereof in single- or double-stranded form, or complements thereof. The term encompasses 
nucleic acids containing known nucleotide analogs or modified backbone residues or 
linkages, which are synthetic, naturally occurring, and non-naturally occurring, which have 
25 similar binding properties as the reference nucleic acid, and which are metabolized in a 
manner similar to the reference nucleotides. Examples of such analogs include, without 
limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl 
phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs). Nucleic acids also 
include complementary nucleic acids. 
30 Unless otherwise indicated, a particular nucleic acid sequence also implicitly 

encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) 
and complementary sequences, as well as the sequence explicitly indicated. Specifically, 
degenerate codon substitutions may be achieved by generating sequences in which the third 
position of one or more selected (or all) codons is substituted with mixed-base and/or 
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deoxyinosine residues (Batzer et ah, Nucleic Acid Res. 19:5081 (1991); Ohtsuka et aL, J. 
Biol. Chem. 260:2605-2608 (1985); Rossolini et al, Mol. Cell Probes 8:91-98 (1994)). The 
term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, and 
polynucleotide. 

A particular nucleic acid sequence also implicitly encompasses "splice 
variants." Similarly, a particular protein encoded by a nucleic acid implicitly encompasses 
any protein encoded by a splice variant of that nucleic acid. "Splice variants," as the name 
suggests, are products of alternative splicing of a gene. After transcription, an initial nucleic 
acid transcript may be spliced such that different (alternate) nucleic acid splice products 
encode different polypeptides. Mechanisms for the production of splice variants vary, but 
include alternate splicing of exons. Alternate polypeptides derived from the same nucleic 
acid by read-through transcription are also encompassed by this definition. Any products of a 
splicing reaction, including recombinant forms of the splice products, are included in this 
definition. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers and 
non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function in a manner similar to 
the naturally occurring amino acids. Naturally occurring amino acids are those encoded by 
the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical 
compounds that have a structure that is different from the general chemical structure of an 
amino acid, but that functions in a manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
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Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 



acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
5 variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 
number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 

10 position where an alanine is specified by a codon, the codon can be altered to any of the 

corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 

15 in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a 
polypeptide is implicit in each described sequence with respect to the expression product, but 
not with respect to actual probe sequences. 

20 As to amino acid sequences, one of skill will recognize that individual 

substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 

25 Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. 



substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic 
30 acid (E); 3) Asparagine (N)> Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 
(W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, 
Proteins (1984)). 



'Conservatively modified variants" applies to both amino acid and nucleic 



The following eight groups each contain amino acids that are conservative 
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Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g., Alberts et ah, Molecular Biology of the Cell (3 rd ed., 1994) and Cantor and Schimmel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains, e.g., transmembrane domains, pore domains, 
and cytoplasmic tail domains. Domains are portions of a polypeptide that fonn a compact 
unit of the polypeptide and are typically 15 to 350 amino acids long. Exemplary domains 
include domains with enzymatic activity, e.g., phosphatase domains, ligand binding domains, 
etc. Typical domains are made up of sections of lesser organization such as stretches of 8- 
sheet and a-helices. 'Tertiary structure" refers to the complete three dimensional structure of 
a polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed by the noncovalent association of independent tertiary units. Anisotropic terms are 
also known as energy terms. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
detect antibodies specifically reactive with the peptide. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for 
example, recombinant cells express genes that are not found within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under expressed or not expressed at all. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences from unrelated genes arranged to 
make a new functional nucleic acid, e.g., a promoter from one source and a coding region 
from another source. Similarly, a heterologous protein indicates that the protein comprises 
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two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fusion protein). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 
5 nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular' Biology— Hybridization with Nucleic 
Probes , "Overview of principles of hybridization and the strategy of nucleic acid assays" 

10 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (T ro ) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 

15 equiUbrium). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is at least 
two times background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 

20 0.1%SDSat65°C. 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, for example, when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 

25 under moderately stringent hybridization conditions. Exemplary "moderately stringent 

hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 

30 guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

For PCR, a temperature of about 36°C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32°C and 48°C 
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depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 
to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 
5 30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
reactions are provided, e.g., in Innis et al. (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

"Antibody" refers to a polypeptide comprising a framework region from an 

10 immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 

15 IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody 
will be most critical in specificity and affinity of binding. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 

20 of each chain defines a variable region of about 1 00 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, for example, 

25 pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)*2, 
a dimer of Fab which itself is a light chain joined to Vh-C h 1 by a disulfide bond. The F(ab)' 2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 

30 1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
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antibodies, or those syntnesized de novo using recombinant DNA mSnodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et al, Nature 
348:552-554 (1990)) 



5 antibodies, many technique known in the art can be used (see, e.g. , Kohler & Milstein, 

Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4: 72 (1983); Cole et al, pp. 
11 '-96 in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985); Coligan, 
Cun-ent Protocols in Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual 
(1988); and Goding, Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). The 

10 genes encoding the heavy and light chains of an antibody of interest can be cloned from a 
cell, e.g., the genes encoding a monoclonal antibody can be cloned from a hybridoma and 
used to produce a recombinant monoclonal antibody. Gene libraries encoding heavy and 
light chains of monoclonal antibodies can also be made from hybridoma or plasma cells. 
Random combinations of the heavy and light chain gene products generate a large pool of 

15 antibodies with different antigenic specificity (see, e.g., Kuby, Immunology (3 rd ed. 1997)). 
Techniques for the production of single chain antibodies or recombinant antibodies (U.S. 
Patent 4,946,778, U.S. Patent No. 4,816,567) can be adapted to produce antibodies to 
polypeptides of this invention. Also, transgenic mice, or other organisms such as other 
mammals, may be used to express humanized or human antibodies (see, e.g., U.S. Patent 

20 Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, Marks et al. t 

Bio/Technology 10:779-783 (1992); Lonberg et al, Nature 368:856-859 (1994); Morrison, 
Nature 368:812-13 (1994); Fishwild et al, Nature Biotechnology 14:845-51 (1996); 
Neuberger, Nature Biotechnology 14:826 (1996); and Lonberg & Huszar, Intern. Rev. 
Immunol 13:65-93 (1995)). Alternatively, phage display technology can be used to identify 

25 antibodies and heteromeric Fab fragments that specifically bind to selected antigens (see, e.g., 
McCafferty et al, Nature 348:552-554 (1990); Marks et al, Biotechnology 10:779-783 
(1992)). Antibodies can also be made bispecific, i.e., able to recognize two different antigens 
(see, e.g., WO 93/08829, Traunecker et al, EMBO J. 10:3655-3659 (1991); and Suresh et al, 
Methods in Enzymology 121:210 (1986)). Antibodies can also be heteroconjugates, e.g., two 

30 covalently joined antibodies, or immunotoxins (see, e.g., U.S. Patent No. 4,676,980 , WO 
91/00360; WO 92/200373; and EP 03089). 



known in the art. Generally, a humanized antibody has one or more amino acid residues 
introduced into it from a source which is non-human. These non-human amino acid residues 



For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 



Methods for humanizing or primatizing non-human antibodies are well 
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are often referred to as import residues, which are typically taken from an import variable 
domain. Humanization can be essentially perfonned following the method of Winter and co- 
workers {see, e.g., Jones et al, Nature 321:522-525 (1986); Riechmann et al, Nature 
332:323-327 (1988); Verhoeyen et al, Science 239:1534-1536 (1988) and Presta, Curr. Op. 
5 Struct Biol 2:593-596 (1992)), by substituting rodent CDRs or CDR sequences for the 

corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. In practice, humanized antibodies are typically human antibodies in which 
1 0 some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 



region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
15 and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 



20 effector moiety can be any number of molecules, including labeling moieties such as 

radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
antibody modulates the activity of the protein. 



(or selectively) immunoreactive with," when referring to a protein or peptide, refers to a 
25 binding reaction that is determinative of the presence of the protein, often in a heterogeneous 
population of proteins and other biologies. Thus, under designated immunoassay conditions, 
the specified antibodies bind to a particular protein at least two times the background and 
more typically more than 10 to 100 times background. Specific binding to an antibody under 
such conditions requires an antibody that is selected for its specificity for a particular protein. 
30 For example, polyclonal antibodies raised to angiogenesis regulatory proteins, polymorphic 
variants, alleles, orthologs, and conservatively modified variants, or splice variants, or 
portions thereof, can be selected to obtain only those polyclonal antibodies that are 
specifically immunoreactive with angiogenesis regulatory proteins and not with other 
proteins. This selection may be achieved by subtracting out antibodies that cross-react with 



A "chimeric antibody" is an antibody molecule in which (a) the constant 



In one embodiment, the antibody is conjugated to an "effector" moiety. The 



The phrase "specifically (or selectively) binds" to an antibody or "specifically 
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other molecules. A vanely of immunoassay formats may be used to select antibodies 
specifically immunoreactive with a particular protein. For example, solid-phase ELIS A 
immunoassays are routinely used to select antibodies specifically immunoreactive with a 
protein (see, e.g., Harlow & Lane, Antibodies, A Laboratoiy Manual (1988) for a description 
5 of immunoassay formats and conditions that can be used to determine specific 
immunoreactivity). 

Assays for proteins that modulate angiogenesis 



10 modulators of angiogenesis. Such assays can monitor changes in cell surface marker 

expression, ov/33 integrin production, proliferation, and differentiation using either cell lines 
or primary cells, e.g., endothelial cells such as HUVEC cells. Typically, early passage or 
primary endothelial cells are contacted with a cDNA or a random peptide library (encoded by 
nucleic acids). Often the libraries are retroviral libraries. The cDNA library can comprise 

15 sense, antisense, full length, and truncated cDNAs. The cDNAs are often from endothelial 
cells, e.g., HUVEC cells, either differentiated on matrigel or 3-d collagen, or undifferentiated. 
The peptide library is encoded by nucleic acids. The cDNAs are optionally fused to markers 
such as GFP. The effect of the cDNA or peptide library on the endothelial cells is then 
monitored, using an assay such as cell surface marker expression (e.g., av/33 integdn or 

20 VEGF2-R2) or a phenotypic assay for angiogenesis such as migration towards an ECM 

(extracellular matrix) component (see, e.g., Klemke et al. t J. Cell Biol. 4:961-972 (1998)) or 
endothelial cell tube formation assays, as well as other bioassays such as the chick CAM 
assay, the mouse corneal assay, haptotaxis assays, and assays measuring the effect of 
administering potential modulators on implanted tumors. The effect of the cDNA or peptide 

25 can be validated and distinguished from somatic mutations, using, e.g., regulatable expression 
of the nucleic acid such as expression from a tetracycline promoter. cDNAs and nucleic 
acids encoding peptides can be rescued using techniques known to those of skill in the art, 
e.g., using a sequence tags. 



30 (e.g., angiogenesis regulatory protein) can be isolated using a yeast two-hybrid system, 

mammalian two hybrid system, immunoprecipitation and proteomics, phage display screen, 
etc. Targets so identified can be further used as bait in these assays to identify additional 
members of the angiogenesis pathway, which members are also targets for drug development 
(see, e.g., Fields et al, Nature 340:245 (1989); Vasavada et aL, Proc. Nat 'I Acad. ScL USA 



High throughput functional genomics assays can be used to identify 



Proteins interacting with the peptide or with the protein encoded by the cDNA 
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88:10686 (1991); Fearon etal, Proc. Nat'! Acad. Set USA 89:7958 (1992); Dang et al, Mol 
Cell Biol. 11:954 (1991); Chien et al, Proc. Nat'lAcad. Set USA 9578 (1991); and U.S. 
Patent Nos. 5,283,173, 5,667,973, 5,468,614, 5,525,490, and 5,637,463). 



5 Jaffe et al, J. Clin. Invest. 52:2745-2754 (1973)); human adult dermal capillary-derived cells 
{see, e.g., Davison et al, In Vitro 19:937-945 (1983)); human adipose capillary derived cells 
(see, e.g., Kern et al, J. Clin Invest. 71:1822-1829 (1983); bovine aorta (see, e.g., Booyse et 
al, Thromb. Diathes. Ahemorrh. 34:825-839 (1975); and rat brain capillary derived cells 
(see, e.g., Bowman et al, In Vifro 17:353-362 (1981)). For culture of endothelial cell lines, 
10 explants, and primary cells, see Freshney et al, Culture of Animal Cells (3 rd ed. 1994). 
Suitable angiogenesis cell surface markers include gv/33 integrin (see, e.g., Elicerir & 
Cheresh, Cancer J. Sci. Am. 6 Supp. 3:S245-249 (2000), Maeshima et al, J. Biol Chem. 
(June 8, 2001)). 



1 5 antibodies and FACS. Cell proliferation can be measured using H-thymidine or dye 
inclusion. Angiogenesis phenotype is measured by loss of phenotype observation. 



endothelial cells. Libraries encoding random peptides are made according to techniques well 
known to those of skill in the art (see, e.g., U.S. Patent No. 6,153,380, 6,114,111, and 
20 6,1 80,343). Any suitable vector can be used for the cDNA and peptide libraries, including, 
e.g., retroviral vectors. 



nucleic acids that encode the target proteins are identified using a high throughput cell based 
assay (using a microtiter plate format) and FACS screening for av03 cell surface expression. 

25 cDNA libraries are made which include, e.g., sense, antisense, full length, and truncated 

cDNAs. The cDNAs are cloned into a retroviral vector. Endothelial cells are infected with 
the library, cultured for a short effector phase and then the cells with reduced av/33 surface 
levels are enriched by antibody staining and magnetic cell sorting. The enriched cell 
population is then sorted into microtiter plates using fluorescent antibodies and FACS. 

30 Resultant cell colonies are analyzed by immunofluorescence for reduced ccv03 surface levels. 
Selected colonies are infected with wild type MMLV virus to rescue the proviral vector. The 
infectious supernatant is used to infect endothelial cells, which are subsequently analyzed for 
av/33 levels by FACS. The cDNA is isolated and sequenced to determined if it represents a 
wild type or mutated cDNA, e.g., whether the cDNA represents a negative transdominant 



Suitable endothelial cell lines include human umbilical vein cells (see, e.g. 



Cell surface markers such as ov/33 can be assayed using fluorescently labeled 



cDNA libraries are made from any suitable source, preferably from 



In a preferred embodiment, target proteins that modulate angiogenesis or 
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mutant. Optionally, a marker such as GFP can be used to select for retrovirally infected cells. 
Using this system, cDNA molecules encoding angiogenesis regulatory protein were identified 
as a targets for anti-angiogenic drug therapy. Some of the cDNAs were fused in frame to a 
GFP marker. Some cDNA molecules were expressed as inhibitory antisense molecules. 
Finally, some cDNAs were expressed in the antisense orientation and expression of the 
translated protein or peptide resulted in an angiogenesis phenotype. 

Isolation of nucleic acids encoding angiogenesis regulatory proteins 

This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et aL, Molecular Cloning, A Laboratory Manual (2nd ed. 1989); Kriegler, Gene 
Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in Molecular 
Biology (Ausubel et al 9 eds., 1994)). 



polymorphic variants, orthologs, and alleles that are substantially identical to the amino acid 
sequence of SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ ID NO:64, 
SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO:82, SEQ ID NO:87, SEQ ID 
NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID NO:140, SEQ ID NO:154, SEQ ID 
NO:164, SEQ ID NO:170, SEQ ID NO:172, SEQ ID NO:174, SEQ ID NO:176, SEQ ID 
NO:l 84, SEQ ID NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID 
NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID NO:341, SEQ ID NO:352, SEQ ID 
NO:366, SEQ ID NO:378, SEQ ID NO:385, SEQ ID NO.407, SEQ ID NO:409, SEQ ID 
NO:420, SEQ ID NO:429, SEQ ID NO:438, SEQ ID NO:440, SEQ ID NO:446, SEQ ID 
NO:457, SEQ ID NO:463, SEQ ID NO:482, SEQ ID NO:485, SEQ ID NO:494, SEQ ID 
NO:497, SEQ ID NO:499, and SEQ ID NO:520 can be isolated using nucleic acid probes and 
oligonucleotides under stringent hybridization conditions, by screening libraries. 
Alternatively, expression libraries can be used to clone angiogenesis regulatory proteins, 
polymorphic variants, orthologs, and alleles by detecting expressed homologs 
immunologically with antisera or purified antibodies made against a human angiogenesis 
regulatory protein or portions thereof. 



angiogenesis regulatory protein RNA, e.g., endothelial cells. The mRNA is then made into 
cDNA using reverse transcriptase, ligated into a recombinant vector, and transfected into a 
recombinant host for propagation, screening and cloning. Methods for making and screening 



Nucleic acids encoding angiogenesis regulatory proteins and their 



To make a cDNA library, one should choose a source that is rich in 
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cDNA libraries are well known (see, e.g., Gubler & Hoffinan, Gene 25:263-269 (1983); 
Sambrook et al., supra; Ausubel et al, supra). 

For a genomic library, the DNA is extracted from the tissue and either 
mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. The 
5 fragments are then separated by gradient centrifugation from undesired sizes and are 

constructed in bacteriophage lambda vectors. These vectors and phage are packaged in vitro. 
Recombinant phage are analyzed by plaque hybridization as described in Benton & Davis, 
Science 196:180-182 (1977). Colony hybridization is carried out as generally described in 
Grunstein et al 9 Proa Natl Acad. Sci. USA., 72:3961-3965 (1975). 

10 An alternative method of isolating angiogenesis regulatory protein nucleic 

acid and its orthologs, alleles, mutants, polymorphic variants, and conservatively modified 
variants combines the use of synthetic oligonucleotide primers and amplification of an RNA 
or DNA template (see U.S. Patents 4,683,195 and 4,683,202; PCR Protocols: A Guide to 
Methods and Applications (Innis et al., eds, 1990)). Methods such as polymerase chain 

15 reaction (PCR) and ligase chain reaction (LCR) can be used to amplify nucleic acid 

sequences encoding human angiogenesis regulatory proteins directly from mRNA, from 
cDNA, from genomic libraries or cDNA libraries. Degenerate oligonucleotides can be 
designed to amplify angiogenesis regulatory protein homologs using the sequences provided 
herein. Restriction endonuclease sites can be incorporated into the primers. Polymerase 

20 chain reaction or other in vitro amplification methods may also be useful, for example, to 
clone nucleic acid sequences that code for proteins to be expressed, to make nucleic acids to 
use as probes for detecting the presence of angiogenesis regulatory protein encoding mRNA 
in physiological samples, for nucleic acid sequencing, or for other purposes. Genes amplified 
by the PCR reaction can be purified from agarose gels and cloned into an appropriate vector. 

25 Gene expression of angiogenesis regulatory proteins can also be analyzed by 

techniques known in the art, e.g., reverse transcription and amplification of mRNA, isolation 
of total RNA or poly A + RNA, northern blotting, dot blotting, in situ hybridization, RNase 
protection, high density polynucleotide array technology, e.g., and the like. 

Nucleic acids encoding angiogenesis regulatory proteins can be used with high 

30 density oligonucleotide array technology (e.g., GeneChip™) to identify angiogenesis 

regulatory proteins, orthologs, alleles, conservatively modified variants, and polymorphic 
variants in this invention. Li the case where the homologs being identified are linked to a 
condition such as neovascularization or angiogenic insufficiency, they can be used with 
GeneChip™ as a diagnostic tool in detecting the disease in a biological sample, see, e.g., 
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Gunthand etal, AIDS Res. Hum. Retroviruses 14: 869-876 (1998); Kozal et al, Nat. Med. 
2:753-759 (1996); Matson et al., Anal. Biochem. 224:1 10-106 (1995); Lockhart et al. Nat. 
Biotechnol. 14:1675-1680 (1996); Gingeras et al, Genome Res. 8:435-448 (1998); Hacia et 
al, Nucleic Acids Res. 26:3865-3866 (1998). 



intermediate vectors before transformation into prokaryotic or eukaryotic cells for replication 
and/or expression. These intermediate vectors are typically prokaryote vectors, e.g., 
plasmids, or shuttle vectors. 

10 Expression in prokaryotes and eukaryo tes 

To obtain high level expression of a cloned gene, such as those cDNAs 
encoding an angiogenesis regulatory protein, one typically subclones a gene encoding an 
angiogenesis regulatory protein into an expression vector that contains a strong promoter to 
direct transcription, a transcription/translation terminator, and if for a nucleic acid encoding a 

15 protein, a ribosome binding site for translational initiation. Suitable bacterial promoters are 
well known in the art and described, e.g., in Sambrook et al., and Ausubel et al supra. 
Bacterial expression systems for expressing angiogenesis regulatory proteins are available in, 
e.g., E. coli, Bacillus sp., and Salmonella (Palva et al., Gene 22:229-235 (1983); Mosbach et 
al., Nature 302:543-545 (1983). Kits for such expression systems are commercially 

20 available. Eukaryotic expression systems for mammalian cells, yeast, and insect cells are 
well known in the art and are also commercially available. 

Selection of the promoter used to direct expression of a heterologous nucleic 
acid depends on the particular application. The promoter is preferably positioned about the 
same distance from the heterologous transcription start site as it is from the transcription start 

25 site in its natural setting. As is known in the art, however, some variation in this distance can 
be accommodated without loss of promoter function. 

In addition to the promoter, the expression vector typically contains a 
transcription unit or expression cassette that contains all the additional elements required for 
the expression of the angiogenesis regulatory protein encoding nucleic acid in host cells. A 

30 typical expression cassette thus contains a promoter operably linked to the nucleic acid 
sequence encoding an angiogenesis regulatory protein and signals required for efficient 
polyadenylation of the transcript, ribosome binding sites, and translation termination. 
Additional elements of the cassette may include enhancers and, if genomic DNA is used as 
the structural gene, introns with functional splice donor and acceptor sites. 
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In addition to a promoter sequence, the expression cassette should also contain 
a transcription termination region downstream of the structural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information into 
the cell is not particularly critical. Any of the conventional vectors used for expression in 
eukaryotic or prokaryotic cells may be used. Standard bacterial expression vectors include 
plasmids such as pBR322 based plasmids, pSKF, pET23D, and fusion expression systems 
such as MBP, GST, and LacZ. Epitope tags can also be added to recombinant proteins to 
provide convenient methods of isolation, e.g., c-myc. 

Expression vectors containing regulatory elements from eukaryotic viruses are 
typically used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus vectors, 
retroviral vectors, and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic 
vectors include pMSG, pAV009/A + , pMTO10/A + , pMAMneo-5, baculovirus pDSVE, and 
any other vector allowing expression of proteins under the direction of the CMV promoter, 
S V40 early promoter, S V40 later promoter, metallothionein promoter, murine mammary 
tumor virus promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters 
shown effective for expression in eukaryotic cells. 

Expression of proteins from eukaryotic vectors can be also be regulated using 
inducible promoters. With inducible promoters, expression levels are tied to the 
concentration of inducing agents, such as tetracycline or ecdysone, by the incorporation of 
response elements for these agents into the promoter. Generally, high level expression is 
obtained from inducible promoters only in the presence of the inducing agent; basal 
expression levels are minimal. Inducible expression vectors are often chosen if expression of 
the protein of interest is detrimental to eukaryotic cells. 

In one embodiment, the vectors of the invention have a regulatable promoter, 
e.g., tet-regulated systems and the RU-486 system {see, e.g., Gossen & Bujard, Proc. Nat 'I 
Acad. Sci. USA 89:5547 (1992); Oligino etal, Gene Tlier. 5:491-496 (1998); Wang etal, 
Gene TJier. 4:432-441 (1997); Neering e/ a/., 88:1147-1155 (1996); and Rendahl eta!., 
Nat. Biotechnol. 16:757-761 (1998)). These impart small molecule control on the expression 
of the candidate target nucleic acids. This beneficial feature can be used to determine that a 
desired phenotype is caused by a transfected cDNA rather than a somatic mutation. 

Some expression systems have markers that provide gene amplification such 
as thymidine kinase and dihydrofolate reductase. Alternatively, high yield expression 
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systems not involvin^Jne amplification are also suitable, such as using a baculovirus vector 
in insect cells, with a angiogenesis regulatory protein encoding sequence under the direction 
of the polyhedrin promoter or other strong baculovirus promoters. 

The elements that are typically included in expression vectors also include a 
5 replicon that functions in E. coli, a gene encoding antibiotic resistance to permit selection of 
bacteria that harbor recombinant plasrnids, and unique restriction sites in nonessential regions 
of the plasmid to allow insertion of eukaryotic sequences. The particular antibiotic resistance 
gene chosen is not critical, any of the many resistance genes known in the art are suitable. 
The prokaryotic sequences are preferably chosen such that they do not interfere with the 

10 replication of the DNA in eukaryotic cells, if necessary. 

Standard transfection methods are used to produce bacterial, mammalian, 
yeast or insect cell lines that express large quantities of angiogenesis regulatory proteins, 
which are then purified using standard techniques (see, e.g., Colley et al. 9 J. Biol Chem. 
264:17619-17622 (1989); Guide to Protein Purification, inMethods in Enzymology, vol. 182 

15 (Deutscher, ed., 1990)). Transformation of eukaryotic and prokaryotic cells are performed 
according to standard techniques (see, e.g., Morrison, J. Bact. 132:349-351 (1977); Clark- 
Curtiss & Curtiss, Methods in Enzymology 101:347-362 (Wu etal, eds, 1983). 

Any of the well-known procedures for introducing foreign nucleotide 
sequences into host cells may be used. These include the use of calcium phosphate 

20 transfection, polybrene, protoplast fusion, electroporation, biolistics, liposomes, 

microinjection, plasma vectors, viral vectors and any of the other well known methods for 
introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material 
into a host cell (see, e.g., Sambrook et al, supra). It is only necessary that the particular 
genetic engineering procedure used be capable of successfully introducing at least one gene 

25 into the host cell capable of expressing angiogenesis regulatory protein. 

After the expression vector is introduced into the cells, the transfected cells are 
cultured under conditions favoring expression of angiogenesis regulatory protein, which is 
recovered from the culture using standard techniques identified below. 

30 Purification of angiogenesis regulatory proteins 

Either naturally occurring or recombinant angiogenesis regulatory proteins can 
be purified for use in functional assays. Naturally occurring angiogenesis regulatory protein 
can be purified, e.g., from human tissue. Recombinant angiogenesis regulatory protein can 
be purified from any suitable expression system. 
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The angiogenesis regulatory proteins may be purified to substantial purity by 
standard techniques, including selective precipitation with such substances as ammonium 
sulfate; column chromatography, immunopurification methods, and others (see, e.g., Scopes, 
Protein Purification: Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et 
al, supra; and Sambrook et al, supra). 

A number of procedures can be employed when recombinant angiogenesis 
regulatory proteins are being purified. For example, proteins having established molecular 
adhesion properties can be reversible fused to the angiogenesis regulatory proteins. With the 
appropriate ligand, the angiogenesis regulatory proteins can be selectively adsorbed to a 
purification column and then freed from the column in a relatively pine form. The fused 
protein is then removed by enzymatic activity. Finally, the angiogenesis regulatory proteins 
could be purified using immunoaffinity columns. 



A. Purification of angiogenesis regulatory proteins from recombinant bacteria 

Recombinant proteins are expressed by transformed bacteria in large amounts, 
typically after promoter induction; but expression can be constitutive. Promoter induction 
with EPTG is one example of an inducible promoter system. Bacteria are grown according to 
standard procedures in the art. Fresh or frozen bacteria cells are used for isolation of protein. 

Proteins expressed in bacteria may form insoluble aggregates ("inclusion 
bodies"). Several protocols are suitable for purification of inclusion bodies containing 
angiogenesis regulatory proteins. For example, purification of inclusion bodies typically 
involves the extraction, separation and/or purification of inclusion bodies by disruption of 
bacterial cells, e.g., by incubation in a buffer of 50 mM TRIS/HCL pH 7.5, 50 mM NaCl, 5 
mM MgCl 2 , 1 mM DTT, 0.1 mM ATP, and 1 mM PMSF. The cell suspension can be lysed 
using 2-3 passages through a French Press, homogenized using a Polytron (Brinkman 
Instruments) or sonicated on ice. Alternate methods of lysing bacteria are apparent to those 
of skill in the art (see, e.g., Sambrook et al, supra; Ausubel et al, supra). 

If necessary, the inclusion bodies are solubilized, and the lysed cell suspension 
is typically centrifuged to remove unwanted insoluble matter. Proteins that formed the 
inclusion bodies may be renatured by dilution or dialysis with a compatible buffer. Suitable 
solvents include, but are not limited to urea (from about 4 M to about 8 M), formamide (at 
least about 80%, volume/volume basis), and guanidine hydrochloride (from about 4 M to 
about 8 M). Some solvents which are capable of solubilizing aggregate-forming proteins, for 
example SDS (sodium dodecyl sulfate), 70% formic acid, are inappropriate for use in this 
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procedure due to the possibility of irreversible denaturation of the proteins, accompanied by a 
lack of immunogenicity and/or activity. Although guanidine hydrochloride and similar 
agents are denaturants, this denaturation is not irreversible and renaturation may occur upon 
removal (by dialysis, for example) or dilution of the denaturant, allowing re-formation of 
immunologically and/or biologically active protein. Other suitable buffers are known to 
those skilled in the art. Human angiogenesis regulatory proteins are separated from other 
bacterial proteins by standard separation techniques, e.g., with Ni-NTA agarose resin. 

Alternatively, it is possible to purify angiogenesis regulatory proteins from 
bacteria periplasm. After lysis of the bacteria, when the angiogenesis regulatory proteins are 
exported into the periplasm of the bacteria, the periplasmic fraction of the bacteria can be 
isolated by cold osmotic shock in addition to other methods known to skill in the art. To 
isolate recombinant proteins from the periplasm, the bacterial cells are centrifuged to form a 
pellet. The pellet is resuspended in a buffer containing 20% sucrose. To lyse the cells, the 
bacteria are centrifuged and the pellet is resuspended in ice-cold 5 mM MgS0 4 and kept in an 
ice bath for approximately 10 minutes. The cell suspension is centrifuged and the 
supernatant decanted and saved. The recombinant proteins present in the supernatant can be 
separated from the host proteins by standard separation techniques well known to those of 
skill in the art. 



B. Standard protein separation techniques for purifying angiogenesis regulatoiy 
proteins 

Solubility fractionation 

Often as an initial step, particularly if the protein mixture is complex, an initial 
salt fractionation can separate many of the unwanted host cell proteins (or proteins derived 
from the cell culture media) from the recombinant protein of interest. The preferred salt is 
ammonium sulfate. Ammonium sulfate precipitates proteins by effectively reducing the 
amount of water in the protein mixture. Proteins then precipitate on the basis of their 
solubility. The more hydrophobic a protein is, the more likely it is to precipitate at lower 
ammonium sulfate concentrations. A typical protocol includes adding saturated ammonium 
sulfate to a protein solution so that the resultant ammonium sulfate concentration is between 
20-30%. This concentration will precipitate the most hydrophobic of proteins. The 
precipitate is then discarded (unless the protein of interest is hydrophobic) and ammonium 
sulfate is added to the supernatant to a concentration known to precipitate the protein of 
interest. The precipitate is then solubilized in buffer and the excess salt removed if 
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necessary, either through dialysis or diafiltration. Other methods that rely on solubility of 
proteins, such as cold ethanol precipitation, are well known to those of skill in the art and can 
be used to fractionate complex protein mixtures. 

5 Size differential filtration 

The molecular weight of the angiogenesis regulatory proteins can be used to 
isolate it from proteins of greater and lesser size using ultrafiltration through membranes of 
different pore size (for example, Amicon or Millipore membranes). As a first step, the 
protein mixture is ultrafiltered through a membrane with a pore size that has a lower 
10 molecular weight cut-off than the molecular weight of the protein of interest. The retentate of 
the ultrafiltration is then ultrafiltered against a membrane with a molecular cut off greater 
than the molecular weight of the protein of interest. The recombinant protein will pass 
through the membrane into the filtrate. The filtrate can then be chromatographed as 
described below. 

15 

Column chromatography 

The angiogenesis regulatory proteins can also be separated from other proteins 
on the basis of its size, net surface charge, hydrophobicity, and affinity for ligands. In 
addition, antibodies raised against proteins can be conjugated to column matrices and the 
20 proteins immunopurified. All of these methods are well known in the art. It will be apparent 
to one of skill that chromatographic techniques can be performed at any scale and using 
equipment from many different manufacturers (e.g., Pharmacia Biotech). 

Assays for modulators of angiogenesis regulatory proteins and angiogenesis 

25 A. Assays 

Modulation of a angiogenesis regulatory proteins, and corresponding 
modulation of angiogenesis, can be assessed using a variety of in vitro and in vivo assays, 
including high throughput ligand binding and cell based assays, as described herein. Such 
assays can be used to test for inhibitors and activators of angiogenesis regulatory proteins, 

30 and, consequently, inhibitors and activators of angiogenesis. Such modulators of 

angiogenesis regulatory proteins are useful for treating angiogenesis disorders. Modulators 
of angiogenesis regulatory proteins are tested using either recombinant or naturally occurring 
angiogenesis regulatory protein, preferably human angiogenesis regulatory protein. 
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Preferamy^the angiogenesis regulatory proteins win^uive the sequence 
displayed in SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ ID 
NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO: 82, SEQ ID NO:87, 
SEQ ID NO:90, SEQ ID NO: 121, SEQ ID NO: 129, SEQ ID NO: 140, SEQ ID N0.154, SEQ 
5 ID NO:164, SEQ ID NO:170, SEQ ID NO:172, SEQ ID NO:174, SEQ ID NO:176, SEQ ID 
NO: 184, SEQ ID NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO.309, SEQ ID 
NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID NO:341, SEQ ID NO:352, SEQ ID 
NO:366, SEQ ID NO:378, SEQ ID NO:385, SEQ ID NO:407, SEQ ID NO:409, SEQ ID 
NO:420, SEQ ID NO:429, SEQ ID NO:438, SEQ ID NO:440, SEQ ID NO:446, SEQ ID 

10 NO:457, SEQ ID NO:463, SEQ ID NO:482, SEQ ID NO:485, SEQ ID NO:494, SEQ ID 
NO:497, SEQ ID NO:499, SEQ ID NO:520, SEQ ID NO:522, and SEQ ID NO:524 or a 
conservatively modified variant thereof. Alternatively, the angiogenesis regulatory proteins 
of the assay will be derived from a eukaryote and include an amino acid subsequence having 
substantial amino acid sequence identity to SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, 

15 SEQ ID NO:58, SEQ ID NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID 
NO:82, SEQ ID NO:87, SEQ ID NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID 
NO:140, SEQ ID NO:154, SEQ ID NO:164, SEQ ID NO:170, SEQ ID NO:172, SEQ ID 
NO: 174, SEQ ID NO: 176, SEQ ID NO: 184, SEQ ID NO.203, SEQ ID NO:287, SEQ ID 
NO:298, SEQ ID NO:309, SEQ ID NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID 

20 NO:341, SEQ ID NO:352, SEQ ID NO:366, SEQ ID NO:378, SEQ ID NO:385, SEQ ID 
NO:407, SEQ ID NO.409, SEQ ID NO:420, SEQ ID NO:429, SEQ ID NO:438, SEQ ID 
NO:440, SEQ ID NO:446, SEQ ID NO:457, SEQ ID NO:463, SEQ ID NO:482, SEQ ID 
NO:485, SEQ ED NO:494, SEQ ID NO:497, SEQ ID NO:499, SEQ ID NO.520, SEQ ID 
NO:522, and SEQ ID NO:524. Generally, the amino acid sequence identity will be at least 

25 60%, preferably at least 65%, 70%, 75%, 80%, 85%, or 90%, most preferably at least 95%. 

Measurement of an angiogenic or loss-of-angiogenesis phenotype in 
angiogenesis regulatory proteins or cells expressing angiogenesis regulatory proteins, either 
recombinant or naturally occurring, can be performed using a variety of assays, in vitro, in 
vivo, and ex vivo. For example, recombinant or naturally occurring angiogenesis regulatory 

30 protein can be used in vitro, in a ligand binding or enzymatic function assay. Angiogenesis 
regulatory proteins present in a cellular extract can also be used in in vitro assays. Cell- and 
animal-based in vivo assays can also be used to assay for angiogenesis regulatory protein 
modulators. Any suitable physical, chemical, or phenotypic change that affects activity or 
binding can be used to assess the influence of a test compound on the polypeptide of this 
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invention. When the functional effects are determined using intact cells or animals, one can 
also measure a variety of effects such as, in the case of angiogenesis associated with tumors, 
tumor growth, neovascularization, cell surface markers such as av/33, hormone release, 
transcriptional changes to both known and uncharacterized genetic markers (e.g., northern 
bl6ts), changes in cell metabolism such as cell growth or pH changes, and changes in 
intracellular second messengers such as cGMP. In one embodiment, measurement of avft3 
integrin cell surface expression and FACS sorting is used to identify modulators of 
angiogenesis. 



modulating activity, e.g., anti-angiogenic activity, can be performed in vitro, e.g., binding 
assays. Such assays can used full length angiogenesis regulatory proteins or a variant thereof 
{see, e.g., SEQ ID NO:2), or a fragment of an angiogenesis regulatory proteins having a 
desired activity. Purified recombinant or naturally occurring angiogenesis regulatory proteins 
can be used in the in vitro methods of the invention. In addition to purified angiogenesis 
regulatory proteins, the recombinant or naturally occurring angiogenesis regulatory proteins 
can be part of a cellular lysate. As described below, the assay can be either solid state or 
soluble. Preferably, the protein is bound to a solid support, either covalently or non- 
covalently. Often, the in vitro assays of the invention are ligand binding or ligand affinity 
assays, either non-competitive or competitive. Other in vitro assays include measuring 
changes in spectroscopic (e.g., fluorescence, absorbance, refractive index), hydrodynamic 
(e.g., shape), chromatographic, or solubility properties for the protein. 



the angiogenesis regulatory proteins or chimera comprising a fragment thereof is contacted 
with a potential modulator and incubated for a suitable amount of time. In one embodiment, 
the potential modulator is bound to a solid support, and the angiogenesis regulatory proteins 
is added. In another embodiment, the angiogenesis regulatory proteins is bound to a solid 
support. A wide variety of modulators can be used, as described below, including small 
organic molecules, peptides, and antibodies. A wide variety of assays can be used to identify 
angiogenesis regulatory protein -modulator binding or enzymatic activity, including labeled 
protein-protein binding assays, electrophoretic mobility shifts, immunoassays, and the like. 
In some cases, the binding of the candidate modulator is determined through the use of 
competitive binding assays, where interference with binding of a known ligand is measured 



In vitro assays 

Assays to identify compounds with angiogenesis regulatory protein 



In one embodiment, a high throughput binding assay is performed in which 
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m 



in the presence of a potential modulator. Often, either the potential modulator or the known 
ligand is labeled. Binding of the candidate modulator can also be assessed by determining its 
effect on the enzymatic activity of an angiogenesis regulatory protein. 

Cell-based in vivo assays 

In another embodiment, an angiogenesis regulatory proteins is expressed in a 
cell, and functional, e.g., physical and chemical or phenotypic, changes are assayed to 
identify angiogenesis modulators, preferably anti-angiogenesis compounds. Cells expressing 
angiogenesis regulatory proteins can also be used in binding assays or enzymatic assays. 
Any suitable functional effect can be measured, as described herein. For example, ligand 
binding, cell surface marker expression, cellular proliferation, cellular differentiation assays 
and cell migration assays are all suitable assays to identify potential modulators using a cell 
based system. Suitable cells for such cell based assays include both primary endothelial cells 
and cell lines, as described herein. The angiogenesis regulatory proteins can be naturally 
occurring or recombinant. Also, as described above, a fragment of an angiogenesis 
regulatory protein with a desired activity can be used in cell based assays. 



measured by contacting endothelial cells comprising an angiogenesis regulatory protein target 
with a potential modulator. Modulation of angiogenesis is identified by screening for cell 
surface marker expression, e.g., integrin expression levels, using fluorescent antibodies 
and FACS sorting. 

In another embodiment, cellular angiogenesis regulatory protein levels are 
determined by measuring the level of protein or mRNA. The level of angiogenesis regulatory 
protein or proteins are measured using immunoassays such as western blotting, ELISA and 
the like with an antibody that selectively binds to the angiogenesis regulatory protein 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 
PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 
blotting, are preferred. The level of protein or mRNA is detected using directly or indirectly 
labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 



As described above, in one embodiment, loss-of angiogenesis phenotype is 



Alternatively, angiogenesis regulatory protein expression can be measured 
using a reporter gene system. Such a system can be devised using a angiogenesis regulatory 
proteins promoter operably linked to a reporter gene such as chloramphenicol 
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acetyltransferase, firefly luciferase, bacterial luciferase, p-galactosidase and alkaline 
phosphatase. Furthermore, the protein of interest can be used as an indirect reporter via 
attachment to a second reporter such as red or green fluorescent protein {see, e.g., Mistili & 
Spector, Nature Biotechnology 15:961-964 (1997)). The reporter construct is typically 
5 transfected into a cell. After treatment with a potential modulator, the amount of reporter 
gene transcription, translation, or activity is measured according to standard techniques 
known to those of skill in the art. 

In another embodiment, angiogenesis regulatory protein enzymatic activity 
can be measured, using, e.g., labeled substrate proteins, gel electrophoresis, and ELISA 

10 assays, and other appropriate enzymatic activities. 

A variety of phenotypic angiogenesis assays are known to those of skill in the 
art. Various models have been employed to evaluate angiogenesis (e.g., Croix et aL, Science 
289:1 197-1202 (2000) and Kahn et al.,AmeK J. Pathol. 156:1887-1900). Assessment of 
angiogenesis in the presence of a potential modulator of angiogenesis can be performed using 

15 cell-culture-based angiogenesis assays, e.g., endothelial cell tube formation assays, cellular 
differentiation assays using matrigel matrix or by co-culture with smooth muscle cells, 
chemotaxis assays using VEGF or FGF, and haptotaxis assays, as well as other animal based 
bioassays such as the chick CAM assay, the mouse corneal assay, and assays measuring the 
effect of administering potential modulators on implanted tumors. 

20 

Animal models 

A number of animal based assays for angiogenesis phenotypes are known to 
those of skill in the art and can be used to assay for modulators of angiogenesis. For 
example, the chick CAM assay is described by O'Reilly, et al Cell 79: 315-328 (1994). 
25 Briefly, 3 day old chicken embryos with intact yolks are separated from the egg and placed in 
a petri dish. After 3 days of incubation, a methylcellulose disc containing the protein to be 
tested is applied to the CAM of individual embryos. After about 48 hours of incubation, the 
embryos and CAMs are observed to determine whether endothelial growth has been 
inhibited. 

30 The mouse corneal assay involves implanting a growth factor-containing 

pellet, along with another pellet containing the suspected endothelial growth inhibitor, in the 
cornea of a mouse and observing the pattern of capillaries that are elaborated in the cornea. 
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Angiogenesis can also be measured by determining the extent of 
neovascularization of a tumor. For example, carcinoma cells can be subcutaneously 
inoculated into athymic nude mice and tumor growth then monitored. Immunoassays using 
endothelial cell-specific antibodies are typically used to stain for vascularization of tumor and 
5 the number of vessels in the tumor. 

As described above, animal models of angiogenesis find use in screening for 
modulators of angiogenesis. Similarly, transgenic animal technology including gene 
knockout technology, for example as a result of homologous recombination with an 
appropriate gene targeting vector, or gene overexpression, will result in the absence or 

10 increased expression of the angiogenesis regulatory proteins. The same technology can also 
be applied to make knock-out cells. When desired, tissue-specific expression or knockout of 
the gene encoding an angiogenesis regulatory protein may be necessary. Transgenic animals 
generated by such methods find use as animal models of angiogenesis and are additionally 
useful in screening for modulators of angiogenesis. 

1 5 Knock-out cells and transgenic mice can be made by insertion of a marker 

gene or other heterologous gene into the endogenous gene site encoding an angiogenesis 
regulatory protein in the mouse genome via homologous recombination. Such mice can also 
be made by substituting the endogenous angiogenesis regulatory protein with a mutated 
version of angiogenesis regulatory protein, or by mutating the endogenous angiogenesis 

20 regulatory protein, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 

25 chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion {see, e.g., Capecchi et aL, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al^ Manipulating the Mouse Embryo: A Laboratory Manual^ 
Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach* Robertson, ed., IRL Press, Washington, D.C., (1987). 



30 



B. Modulators 

The compounds tested as modulators of angiogenesis regulatory proteins can 
be any small organic molecule, or a biological entity, such as a protein, e.g., an antibody or 
peptide, a sugar, a nucleic acid, e.g., an antisense oligonucleotide, RNAi, or a ribozyme, or a 
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lipid. Alternatively, modulators can be genetically altered versions of an angiogenesis 
regulatory proteins. Typically, test compounds will be small organic molecules, peptides, 
lipids, and lipid analogs. 

Essentially any chemical compound can be used as a potential modulator or 
5 ligand in the assays of the invention, although most often compounds can be dissolved in 
aqueous or organic (especially DMSO-based) solutions are used. The assays are designed to 
screen large chemical libraries by automating the assay steps and providing compounds from 
any convenient source to assays, which are typically run in parallel (e.g., in microliter foimats 
on microliter plates in robotic assays). It will be appreciated that there are many suppliers of 
10 chemical compounds, including Sigma (St. Louis, MO), Aldrich (St. Louis, MO), Sigma- 
Aldrich (St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs Switzerland) and the 
like. 

In one preferred embodiment, high throughput screening methods involve 
providing a combinatorial small organic molecule or peptide library containing a large 

15 number of potential therapeutic compounds (potential modulator or ligand compounds). 
Such "combinatorial chemical libraries" or "ligand libraries" are then screened in one or 
more assays, as described herein, to identify those library members (particular chemical 
species or subclasses) that display a desired characteristic activity. The compounds thus 
identified can serve as conventional "lead compounds" or can themselves be used as potential 

20 or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis, by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide library is formed by combining a set of chemical 

25 building blocks (amino acids) in every possible way for a given compound length (i.e., the 
number of amino acids in a polypeptide compound). Millions of chemical compounds can be 
synthesized through such combinatorial mixing of chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 

30 peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. Res. 37:487-493 
(1991) and Houghton et al y Nature 354:84-88 (1991)). Other chemistries for generating 
chemical diversity libraries can also be used. Such chemistries include, but are not limited to: 
peptoids (e.g., PCT Publication No. WO 91/19735), encoded peptides (e.g., PCT Publication 
No. WO 93/20242), random bio-oligomers (e.g., PCT Publication No. WO 92/00091), 
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benzodiazepines (e.g., U.S. Pat. No. 5,288,514), diversomers such as hydantoins, 
benzodiazepines and dipeptides (Hobbs et al 9 Proc. Nat. Acad. Sci. USA 90:6909-6913 
(1993)), vinylogous polypeptides (Hagihara et al, J. Amer. Chem. Soc. 1 14:6568 (1992)), 
nonpeptidal peptidomimetics with glucose scaffolding (Hirschmann et al. 9 J. Amer. Chem. 
5 Soc. 1 14:9217-9218 (1992)), analogous organic syntheses of small compound libraries (Chen 
et al 9 J. Amer. Chem. Soc. 1 16:2661 (1994)), oligocarbamates (Cho et al. 9 Science 261:1303 
(1993)), and/or peptidyl phosphonates (Campbell et al. 9 J. Org. Chem. 59:658 (1994)), 
nucleic acid libraries (see Ausubel, Berger and Sambrook, all supra), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al 9 Nature 

10 Biotechnology, 14(3):309-3 14 (1996) and PCT/US96/10287), carbohydrate libraries (see, 
e.g. t Liang etal 9 Science, 274:1520-1522 (1996) and U.S. Patent 5,593,853), small organic 
molecule libraries (see, e.g., benzodiazepines, Baum C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 
5,549,974; pyrrolidines, U.S. Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. 

15 Patent 5,506,337; benzodiazepines, 5,288,514, and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). In addition, numerous combinatorial libraries are themselves commercially 

20 available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 



C. Solid State and soluble high throughput assays 

25 In one embodiment the invention provides soluble assays using angiogenesis 

regulatory proteins, or a cell or tissue expressing angiogenesis regulatory proteins, either 
naturally occurring or recombinant. In another embodiment, the invention provides solid 
phase based in vitro assays in a high throughput format, where the angiogenesis regulatory 
protein is attached to a solid phase substrate. Any one of the assays described herein can be 

30 adapted for high throughput screening, e.g., ligand binding, cellular proliferation, cell surface 
marker flux, e.g., av/33 integrin, enzymatic activity, etc.. In one preferred embodiment, the 
cell-based system using orvjS3 integrin modulation and FACS assays is used in a high 
throughput format for identifying modulators of angiogenesis regulatory proteins, and 
therefore modulators of T cell angiogenesis. 



55 



WO 2004/019893 ^^>CT/US2003/027523 

In the high throughput assays of the invention, either soluble or solid state, it is 
possible to screen up to several thousand different modulators or ligands in a single day. This 
methodology can be used for angiogenesis regulatory proteins in vifro, or for cell-based 
assays comprising an angiogenesis regulatory proteins. In particular, each well of a 
5 microtiter plate can be used to run a separate assay against a selected potential modulator, or, 
if concentration or incubation time effects are to be observed, every 5-10 wells can test a 
single modulator. Thus, a single standard microtiter plate can assay about 100 (e.g., 96) 
modulators. If 1536 well plates are used, then a single plate can easily assay from about 100- 
about 1500 different compounds. It is possible to assay many plates per day; assay screens 

10 for up to about 6,000, 20,000, 50,000, or more than 100,000 different compounds are possible 
using the integrated systems of the invention. 

For a solid state reaction, the protein of interest or a fragment thereof, e.g., an 
extracellular domain, or a cell comprising the protein of interest or a fragment thereof as part 
of a fusion protein can be bound to the solid state component, directly or indirectly, via 

15 covalent or non covalent linkage e.g., via a tag. The tag can be any of a variety of 

components. In general, a molecule which binds the tag (a tag binder) is fixed to a solid 
support, and the tagged molecule of interest is attached to the solid support by interaction of 
the tag and the tag binder. 

A number of tags and tag binders can be used, based upon known molecular 

20 interactions well described in the literature. For example, where a tag has a natural binder, 

for example, biotin, protein A, or protein G, it can be used in conjunction with appropriate tag 
binders (avidin, streptavidin, neutravidin, the Fc region of an immunoglobulin, etc.) 
Antibodies to molecules with natural binders such as biotin are also widely available and 
appropriate tag binders; see, SIGMA Immunochemicals 1998 catalogue SIGMA, St. Louis 

25 MO). 

Similarly, any haptenic or antigenic compound can be used in combination 
with an appropriate antibody to form a tag/tag binder pair. Thousands of specific antibodies 
are commercially available and many additional antibodies are described in the literature. 
For example, in one common configuration, the tag is a first antibody and the tag binder is a 
30 second antibody which recognizes the first antibody. In addition to antibody-antigen 

interactions, receptor-ligand interactions are also appropriate as tag and tag-binder pairs. For 
example, agonists and antagonists of cell membrane receptors (e.g., cell receptor-ligand 
interactions such as transferrin, c-kit, viral receptor ligands, cytokine receptors, chemokine 
receptors, interleukin receptors, immunoglobulin receptors and antibodies, the cadherein 
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family, the integrin fanmyTthe selectin family, and the like; see, e.g./Tigott & Power, Hie 
Adhesion Molecule Facts Book I (1 993). Similarly, toxins and venoms, viral epitopes, 
hormones (e.g., opiates, steroids, etc), intracellular receptors (e.g. which mediate the effects 
of various small ligands, including steroids, thyroid hormone, retinoids and vitamin D; 
5 peptides), drugs, lectins, sugars, nucleic acids (both linear and cyclic polymer 

configurations), oligosaccharides, proteins, phospholipids and antibodies can all interact with 
various cell receptors. 



polyureas, polyamides, polyethyleneimines, polyarylene sulfides, polysiloxanes, polyimides, 
10 and polyacetates can also form an appropriate tag or tag binder. Many other tag/tag binder 
pairs are also useful in assay systems described herein, as would be apparent to one of skill 
upon review of this disclosure. 



tags, and include polypeptide sequences, such as poly gly sequences of between about 5 and 
15 200 amino acids. Such flexible linkers are known to persons of skill in the art. For example, 
poly(ethelyne glycol) linkers are available from Shearwater Polymers, Inc. Huntsville, 
Alabama. These linkers optionally have amide linkages, sulfhydryl linkages, or 
heterofimctional linkages. 



20 currently available. Solid substrates are commonly derivatized or functionalized by exposing 
all or a portion of the substrate to a chemical reagent which fixes a chemical group to the 
surface which is reactive with a portion of the tag binder. For example, groups which are 
suitable for attachment to a longer chain portion would include amines, hydroxyl, thiol, and 
carboxyl groups. Aminoalkylsilanes and hydroxyalkylsilanes can be used to functionalize a 

25 variety of surfaces, such as glass surfaces. The construction of such solid phase biopolymer 
arrays is well described in the literature. See, e.g., Merrifield, J. Am. Chem. Soc. 85:2149- 
2154 (1963) (describing solid phase synthesis of, e.g., peptides); Geysen et al, J. Immun. 
Meth. 102:259-274 (1987) (describing synthesis of solid phase components on pins); Frank & 
Doring, Tetrahedron 44:60316040 (1988) (describing synthesis of various peptide sequences 

30 on cellulose disks); Fodor et al, Science, 251:767-777 (1991); Sheldon et al, Clinical 

Chemistry 39(4):718-719 (1993); and Kozal et al, Nature Medicine 2(7):753759 (1996) (all 
describing arrays of biopolymers fixed to solid substrates). Non-chemical approaches for 
fixing tag binders to substrates include other common methods, such as heat, cross-linking by 
UV radiation, and the like. 



Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, 



Common linkers such as peptides, polyethers, and the like can also serve as 



Tag binders are fixed to solid substrates using any of a variety of methods 



WO 2004/019893 




'CT/US2003/027523 



Antibodies to angiogenesis regulatory proteins 

In addition to the detection of genes encoding angiogenesis regulatory proteins 
and gene expression using nucleic acid hybridization technology, one can also use 
5 immunoassays to detect angiogenesis regulatory proteins of the invention. Such assays are 
useful for screening for modulators of angiogenesis regulatory proteins and angiogenesis, as 
well as for therapeutic and diagnostic applications. Immunoassays can be used qualitatively 
or quantitatively to analyze angiogenesis regulatory proteins. A general overview of the 
applicable technology can be found in Harlow & Lane, Antibodies: A Laboratory Manual 
10 (1988). 

A. Production of antibodies 

Methods of producing polyclonal and monoclonal antibodies that react 
specifically with the angiogenesis regulatory proteins are known to those of skill in the art 

15 {see, e.g., Coligan, Current Protocols in Immunology (1991); Harlow & Lane, supra; Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986); and Kohler & Milstein, 
Nature 256:495-497 (1975). Such techniques include antibody preparation by selection of 
antibodies from libraries of recombinant antibodies in phage or similar vectors, as well as 
preparation of polyclonal and monoclonal antibodies by immunizing rabbits or mice (see, 

20 e.g., Huse et aL, Science 246:1275-1281 (1989); Ward et al y Nature 341:544-546 (1989)). 

A number of immunogens comprising portions of angiogenesis regulatory 
proteins may be used to produce antibodies specifically reactive with angiogenesis regulatory 
proteins. For example, recombinant angiogenesis regulatory proteins or an antigenic 
fragment thereof, can be isolated as described herein. Recombinant protein can be expressed 

25 in eukaryotic or prokaryotic cells as described above, and purified as generally described 

above. Recombinant protein is the preferred immunogen for the production of monoclonal or 
polyclonal antibodies. Alternatively, a synthetic peptide derived from the sequences 
disclosed herein and conjugated to a carrier protein can be used an immunogen. Naturally 
occurring protein may also be used either in pure or impure form. The product is then 

30 injected into an animal capable of producing antibodies. Either monoclonal or polyclonal 
antibodies may be generated, for subsequent use in immunoassays to measure the protein. 

Methods of production of polyclonal antibodies are known to those of skill in 
the art. An inbred strain of mice (e.g., BALB/C mice) or rabbits is immunized with the 
protein using a standard adjuvant, such as Freund's adjuvant, and a standard immunization 
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protocol. The animal's immune response to the immunogen preparation is monitored by 
taking test bleeds and determining the titer of reactivity to the beta subunits. When 
appropriately high titers of antibody to the immunogen are obtained, blood is collected from 
the animal and antisera are prepared. Further fractionation of the antisera to enrich for 
5 antibodies reactive to the protein can be done if desired {see, Harlow & Lane, supra). 

Monoclonal antibodies may be obtained by various techniques familiar to 
those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma cell (see, Kohler & Milstein, 
Eur. J. Immunol. 6:511-519 (1976)). Alternative methods of immortalization include 

10 transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other methods well 

known in the art. Colonies arising from single immortalized cells are screened for production 
of antibodies of the desired specificity and affinity for the antigen, and yield of the 
monoclonal antibodies produced by such cells may be enhanced by various techniques, 
including injection into the peritoneal cavity of a vertebrate host. Alternatively, one may 

1 5 isolate DNA sequences which encode a monoclonal antibody or a binding fragment thereof 
by screening a DNA library from human B cells according to the general protocol outlined by 
Huse, et ai, Science 246:1275-1281 (1989). 

Monoclonal antibodies and polyclonal sera are collected and titered against the 
immunogen protein in an immunoassay, for example, a solid phase immunoassay with the 

20 immunogen immobilized on a solid support. Typically, polyclonal antisera with a titer of 10 4 
or greater are selected and tested for their cross reactivity against non-angiogenesis regulatory 
proteins, using a competitive binding immunoassay. Specific polyclonal antisera and 
monoclonal antibodies will usually bind with a of at least about 0. 1 mM, more usually at 
least about 1 /xM, preferably at least about 0.1 jiM or better, and most preferably, 0.01 fiM or 

25 better. Antibodies specific only for a particular angiogenesis regulatory protein ortholog, 
such as human angiogenesis regulatory protein, can also be made, by subtracting out other 
cross-reacting orthologs from a species such as a non-human mammal. In this manner, 
antibodies that bind only to angiogenesis regulatory proteins may be obtained. 

Once the specific antibodies against specific angiogenesis regulatory proteins 

30 are available, the proteins can be detected by a variety of immunoassay methods. In addition, 
the antibodies can be used therapeutically as a angiogenesis regulatory protein modulators. 
For a review of immunological and immunoassay procedures, see Basic and Clinical 
Immunology (Stites & Terr eds., 7 th ed. 1991). Moreover, the immunoassays of the present 
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invention can be performed in any of several configurations, which are reviewed extensively 
in Enzyme Immunoassay (Maggio, ed., 1980); and Harlow & Lane, supra. 

B. Immunological binding assays 
5 Angiogenesis regulatory proteins can be detected and/or quantified using any 

of a number of well recognized immunological binding assays {see, e.g., U.S. Patents 
4,366,241; 4,376,110; 4,517,288; and 4,837,168). For a review of the general immunoassays, 
see also Methods in Cell Biology: Antibodies in Cell Biology, volume 37 (Asai, ed. 1993); 
Basic and Clinical Immunology (Stites & Terr, eds., 7th ed. 1991). Immunological binding 

10 assays (or immunoassays) typically use an antibody that specifically binds to a protein or 

antigen of choice (in this case the angiogenesis regulatory proteins or antigenic subsequence 
thereof). The antibody (e.g., anti- angiogenesis regulatory protein) may be produced by any 
of a number of means well known to those of skill in the art and as described above. 

Immunoassays also often use a labeling agent to specifically bind to and label 

15 the complex formed by the antibody and antigen. The labeling agent may itself be one of the 
moieties comprising the antibody/antigen complex. Thus, the labeling agent may be a 
labeled angiogenesis regulatory protein or a labeled anti-angiogenesis regulatory protein 
antibody. Alternatively, the labeling agent may be a third moiety, such a secondary antibody, 
that specifically binds to the antibody/angiogenesis regulatory protein complex (a secondary 

20 antibody is typically specific to antibodies of the species from which the first antibody is 
derived). Other proteins capable of specifically binding immunoglobulin constant regions, 
such as protein A or protein G may also be used as the label agent. These proteins exhibit a 
strong non-immuno genie reactivity with immunoglobulin constant regions from a variety of 
species (see, e.g., Kronval et aL 9 J. Immunol 111 : 1401-1406 (1973); Akerstrom et al. 9 J. 

25 Immunol. 135:2589-2542 (1985)). The labeling agent can be modified with a detectable 

moiety, such as biotin, to which another molecule can specifically bind, such as streptavidin. 
A variety of detectable moieties are well known to those skilled in the art. 

Throughout the assays, incubation and/or washing steps may be required after 
each combination of reagents. Incubation steps can vary from about 5 seconds to several 

30 hours, optionally from about 5 minutes to about 24 hours. However, the incubation time will 
depend upon the assay format, antigen, volume of solution, concentrations, and the like. 
Usually, the assays will be carried out at ambient temperature, although they can be 
conducted over a range of temperatures, such as 10°C to 40°C. 
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Non-competitive assay formats 

Immunoassays for detecting angiogenesis regulatory proteins in samples may 
be either competitive or noncompetitive. Noncompetitive immunoassays are assays in which 
5 the amount of antigen is directly measured. In one preferred "sandwich" assay, for example, 
the anti-angiogenesis regulatory protein antibodies can be bound directly to a solid substrate 
on which they are immobilized. These immobilized antibodies then capture angiogenesis 
regulatory protein present in the test sample. Angiogenesis regulatory proteinss thus 
immobilized are then bound by a labeling agent, such as a second angiogenesis regulatory 
10 protein-antibody bearing a label. Alternatively, the second antibody may lack a label, but it 
may, in turn, be bound by a labeled third antibody specific to antibodies of the species from 
which the second antibody is derived. The second or third antibody is typically modified 
with a detectable moiety, such as biotin, to which another molecule specifically binds, e.g., 
streptavidin, to provide a detectable moiety. 

15 

Competitive assay formats 

In competitive assays, the amount of angiogenesis regulatory proteins present 
in the sample is measured indirectly by measuring the amount of a known, added 
(exogenous) angiogenesis regulatory proteins displaced (competed away) from an anti- 

20 angiogenesis regulatory protein antibody by the unknown angiogenesis regulatory proteins 
present in a sample. In one competitive assay, a known amount of angiogenesis regulatory 
proteins is added to a sample and the sample is then contacted with an antibody that 
specifically binds to angiogenesis regulatory proteins. The amount of exogenous 
angiogenesis regulatory proteins bound to the antibody is inversely proportional to the 

25 concentration of angiogenesis regulatory proteins present in the sample. In a particularly 
preferred embodiment, the antibody is immobilized on a solid substrate. The amount of 
angiogenesis regulatory proteins bound to the antibody may be determined either by 
measuring the amount of angiogenesis regulatory protein present in angiogenesis regulatory 
proteins/antibody complex, or alternatively by measuring the amount of remaining 

30 uncomplexed protein. The amount of angiogenesis regulatory proteins may be detected by 
providing a labeled angiogenesis regulatory protein molecule. 

A hapten inhibition assay is another preferred competitive assay. In this assay 
the known angiogenesis regulatory proteins is immobilized on a solid substrate. A known 
amount of anti-angiogenesis regulatory protein antibody is added to the sample, and the 
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sample is then contacted with the immobilized angiogenesis regulatory protein. The amount 
of anti-angiogenesis regulatory protein antibody bound to the known immobilized 
angiogenesis regulatory protein is inversely proportional to the amount of angiogenesis 
regulatory proteins present in the sample. Again, the amount of immobilized antibody may 
5 be detected by detecting either the immobilized fraction of antibody or the fraction of the 
antibody that remains in solution. Detection may be direct where the antibody is labeled or 
indirect by the subsequent addition of a labeled moiety that specifically binds to the antibody 
as described above. 

10 Cross-reactivity determinations 

Immunoassays in the competitive binding format can also be used for 
crossreactivity determinations. For example, angiogenesis regulatory proteins can be 
immobilized to a solid support. Proteins (e.g., angiogenesis regulatory protein and homologs) 
are added to the assay that compete for binding of the antisera to the immobilized antigen. 

15 The ability of the added proteins to compete for binding of the antisera to the immobilized 
protein is compared to the ability of the angiogenesis regulatory protein to compete with 
itself. The percent crossreactivity for the above proteins is calculated, using standard 
calculations. Those antisera with less than 10% crossreactivity with each of the added 
proteins listed above are selected and pooled. The cross-reacting antibodies are optionally 

20 removed from the pooled antisera by immunoabsorption with the added considered proteins, 
e.g., distantly related homologs. 

The immunoabsorbed and pooled antisera are then used in a competitive 
binding immunoassay as described above to compare a second protein, thought to be perhaps 
an allele or polymorphic variant of a angiogenesis regulatory proteins, to the immunogen 

25 protein. In order to make this comparison, the two proteins are each assayed at a wide range 
of concentrations and the amount of each protein required to inhibit 50% of the binding of the 
antisera to the immobilized protein is determined. If the amount of the second protein 
required to inhibit 50% of binding is less than 10 times the amount of the angiogenesis 
regulatory proteins that is required to inhibit 50% of binding, then the second protein is said 

30 to specifically bind to the polyclonal antibodies generated to angiogenesis regulatory protein 
immunogen. 
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Other assay formats 
Western blot (immunoblot) analysis is used to detect and quantify the presence 
of an angiogenesis regulatory protein in the sample. The technique generally comprises 
separating sample proteins by gel electrophoresis on the basis of molecular weight, 
5 transferring the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a 
nylon filter, or derivatized nylon filter), and incubating the sample with the antibodies that 
specifically bind angiogenesis regulatory protein. The anti-angiogenesis regulatory protein 
antibodies specifically bind to the angiogenesis regulatory protein on the solid support. 
These antibodies may be directly labeled or alternatively may be subsequently detected using 
10 labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically bind to the 
anti-angiogenesis regulatory protein antibodies. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 
15 techniques (see Monroe et aL, Amer. Clin. Prod. Rev. 5:34-41 (1986)). 



Reduction of non-specific binding 

One of skill in the art will appreciate that it is often desirable to minimize non- 
specific binding in immunoassays. Particularly, where the assay involves an antigen or 

20 antibody immobilized on a solid substrate it is desirable to minimize the amount of non- 
specific binding to the substrate. Means of reducing such non-specific binding are well 
known to those of skill in the art. Typically, this technique involves coating the substrate 
with a proteinaceous composition. In particular, protein compositions such as bovine serum 
albumin (BSA), nonfat powdered milk, and gelatin are widely used with powdered milk 

25 being most preferred. 

Labels 

The particular label or detectable group used in the assay is not a critical 
aspect of the invention, as long as it does not significantly interfere with the specific binding 
30 of the antibody used in the assay. The detectable group can be any material having a 

detectable physical or chemical property. Such detectable labels have been well-developed in 
the field of immunoassays and, in general, most any label useful in such methods can be 
applied to the present invention. Thus, a label is any composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical 
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means. Useful labels in the present invention include magnetic beads (e.g., 
DYNABEADS™), fluorescent dyes (e.g., fluorescein isothiocyanate, Texas red, rhodamine, 
and the like), radiolabels (e.g., 3 H, l25 I, 35 S, U C, or 32 P), enzymes (e.g., horse radish 
peroxidase, alkaline phosphatase and others commonly used in an ELIS A), and colorimetric 
5 labels such as colloidal gold or colored glass or plastic beads (e.g., polystyrene, 
polypropylene, latex, etc.). 

The label may be coupled directly or indirectly to the desired component of 
the assay according to methods well known in the art. As indicated above, a wide variety of 
labels may be used, with the choice of label depending on sensitivity required, ease of 

10 conjugation with the compound, stability requirements, available instrumentation, and 
disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds to 
another molecules (e.g., streptavidin) molecule, which is either inherently detectable or 

15 covalently bound to a signal system, such as a detectable enzyme, a fluorescent compound, or 
a chemiluminescent compound. The ligands and their targets can be used in any suitable 
combination with antibodies that recognize angiogenesis regulatory proteins, or secondary 
antibodies that recognize anti-angiogenesis regulatory protein. 

The molecules can also be conjugated directly to signal generating 

20 compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of interest as 

labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or 
oxidotases, particularly peroxidases. Fluorescent compounds include fluorescein and its 
derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. Chemiluminescent 
compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., luminol. For a review 

25 of various labeling or signal producing systems that may be used, see U.S. Patent No. 
4,391,904. 

Means of detecting labels are well known to those of skill in the art. Thus, for 
example, where the label is a radioactive label, means for detection include a scintillation 
counter or photographic film as in autoradiography. Where the label is a fluorescent label, it 
30 may be detected by exciting the fluorochrome with the appropriate wavelength of light and 

detecting the resulting fluorescence. The fluorescence may be detected visually, or by the use 
of electronic detectors such as charge coupled devices (CCDs) or photomultipliers and the 
like. Similarly, enzymatic labels may be detected by providing the appropriate substrates for 
the enzyme and detecting the resulting reaction product. Finally simple colorimetric labels 
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may be detected simply by observing the color associated with the label. Thus, in various 
dipstick assays, conjugated gold often appears pink, while various conjugated beads appear 
the color of the bead. 



5 instance, agglutination assays can be used to detect the presence of the target antibodies. In 
this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labeled and the presence of the 
target antibody is detected by simple visual inspection. 

10 Gene therapy 



protein for the transfection of cells in vitro and in vivo. These nucleic acids can be inserted 
into any of a number of well-known vectors for the transfection of target cells and organisms 
as described below. The nucleic acids are transfected into cells, ex vivo or in vivo, through 

15 the interaction of the vector and the target cell. The nucleic acid, under the control of a 
promoter, then expresses the angiogenesis regulatory proteins of the present invention, 
thereby mitigating the effects of absent, partial inactivation, or abnormal expression of the 
angiogenesis regulatory protein gene, particularly as it relates to angiogenesis. The 
compositions are administered to a patient in an amount sufficient to elicit a therapeutic 

20 response in the patient. An amount adequate to accomplish this is defined as "therapeutically 
effective dose or amount." 



inherited genetic defects, cancer, and other diseases in a number of contexts. The ability to 
express artificial genes in humans facilitates the prevention and/or cure of many important 
25 human diseases, including many diseases which are not amenable to treatment by other 
therapies (for a review of gene therapy procedures, see Anderson, Science 256:808-813 

(1992) ; Nabel & Feigner, TIBTECH 1 1 :21 1-217 (1993); Mitani & Caskey, TIBTECH 
11:162-166 (1993); Mulligan, Science 926-932 (1993); Dillon, TIBTECH 11:167-175 

(1993) ; Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1149-1154 
30 (1998); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & 

Perricaudet, British Medical Bulletin 51(l):31-44 (1995); Haddada et al, in Current Topics 
in Microbiology and Immunology (Doerfler & Bohm eds., 1995); and Yu et al, Gene 
Therapy 1:13-26 (1994)). 



Some assay formats do not require the use of labeled components. For 



The present invention provides the nucleic acids of angiogenesis regulatory 



Such gene therapy procedures have been used to correct acquired and 
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The nucleic acids of the invention can also be used to make transgenic 
animals, such as transgenic mice, either by knock-out or overexpression. Such animals are 
useful, e.g., for testing modulators of angiogenesis. 

Pharmaceutical compositions and administration 

Pharmaceutically acceptable carriers are determined in part by the particular 
composition being administered (e.g., nucleic acid, protein, modulatory compounds or 
transduced cell), as well as by the particular method used to administer the composition. 
Accordingly, there are a wide variety of suitable formulations of pharmaceutical 
compositions of the present invention (see, e.g., Remington 's Pharmaceutical Sciences, 17 
ed., 1989). Administration can be in any convenient manner, e.g., by injection, oral 
administration, inhalation, transdermal application, or rectal administration. 



solutions, such as an effective amount of the packaged nucleic acid suspended in diluents, 
such as water, saline or PEG 400; (b) capsules, sachets or tablets, each containing a 
predetermined amount of the active ingredient, as liquids, solids, granules or gelatin; (c) 
suspensions in an appropriate liquid; and (d) suitable emulsions. Tablet forms can include 
one or more of lactose, sucrose, mannitol, sorbitol, calcium phosphates, corn starch, potato 
starch, microcrystalline cellulose, gelatin, colloidal silicon dioxide, talc, magnesium stearate, 
stearic acid, and other excipients, colorants, fillers, binders, diluents, buffering agents, 
moistening agents, preservatives, flavoring agents, dyes, disintegrating agents, and 
pharmaceutically compatible carriers. Lozenge forms can comprise the active ingredient in a 
flavor, e.g., sucrose, as well as pastilles comprising the active ingredient in an inert base, such 
as gelatin and glycerin or sucrose and acacia emulsions, gels, and the like containing, in 
addition to the active ingredient, carriers known in the art. 



components, can be made into aerosol formulations (i.e., they can be "nebulized") to be 
administered via inhalation. Aerosol formulations can be placed into pressurized acceptable 
propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. 



intraarticular (in the joints), intravenous, intramuscular, intradermal, intraperitoneal, and 
subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection solutions, 
which can contain antioxidants, buffers, bacteriostats, and solutes that render the formulation 
isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile 



Formulations suitable for oral administration can consist of (a) liquid 



The compound of choice, alone or in combination with other suitable 



Formulations suitable for parenteral administration, such as, for example, by 
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suspensions that suspending agents, solubilizers, thickening agents, stabilizers, 

and preservatives. In the practice of this invention, compositions can be administered, for 
example, by intravenous infusion, orally, topically, intraperitoneally, intravesically or 
intrathecally. Parenteral administration and intravenous administration are the preferred 
5 methods of administration. The formulations of commends can be presented in unit-dose or 
multi-dose sealed containers, such as ampules and vials. 

Injection solutions and suspensions can be prepared from sterile powders, 
granules, and tablets of the kind previously described. Cells transduced by nucleic acids for 
ex vivo therapy can also be administered intravenously or parenterally as described above. 

10 The dose administered to a patient, in the context of the present invention 

should be sufficient to effect a beneficial therapeutic response in the patient over time. The 
dose will be determined by the efficacy of the particular vector employed and the condition 
of the patient, as well as the body weight or surface area of the patient to be treated. The size 
of the dose also will be determined by the existence, nature, and extent of any adverse side- 

15 effects that accompany the administration of a particular vector, or transduced cell type in a 
particular patient. 

In detemiining the effective amount of the vector to be administered in the 
treatment or prophylaxis of conditions owing to diminished or aberrant expression of the 
angiogenesis regulatory proteins, the physician evaluates circulating plasma levels of the 

20 vector, vector toxicities, progression of the disease, and the production of anti- vector 

antibodies. In general, the dose equivalent of a naked nucleic acid from a vector is from 
about 1 \x% to 100 \x% for a typical 70 kilogram patient, and doses of vectors which include a 
retroviral particle are calculated to yield an equivalent amount of therapeutic nucleic acid. 

For administration, compounds and transduced cells of the present invention 

25 can be administered at a rate determined by the LD-50 of the inhibitor, vector, or transduced 
cell type, and the side-effects of the inhibitor, vector or cell type at various concentrations, as 
applied to the mass and overall health of the patient. Administration can be accomplished via 
single or divided doses. 

30 EXAMPLES 

Using a functional assay described herein (see also Figure 1), angiogenesis 
regulatory proteins and nucleic acids have been identified and cloned from a green 
fluorescent protein (GFP) fusion cDNA retroviral library. The cDNAs can be in-frame, our 
of frame, sense, or antisense. Primary endothelial cells were transduced with retroviral 
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cDNA/GFP fusion libraries. Transduced cells were selected and assayed for alterations 
consistent with modulation of angiogenesis, e.g. downregulation of cell surface expression of 
op/03 or downregulation of haptotaxis. 

5 Example 1: Identification of plakoglobin angiogenesis regulatory protein. 

Using the functional assay described above, expression of a plakoglobin-GFP 
fusion protein (ie., GFP-AN-plakoglobin) was found to decrease levels of the cell surface 
marker av/?3. (See, e.g., Figures 2 and 3.) The plakoglobin functional hit was an N-terminal 
deletion, fused in frame to the GFP protein that contained armadillo repeats. (See, e.g., 
10 Figure 3). GFP-AN-Plakoglobin expression downregulated vitronectin receptors (ov/33 and 
av/55) but not the fibronectin receptor (o5/?l). (See, e.g., Figure 4.) 

The angiogenesis phonetype of GFP-AN-Plakoglobin was confirmed by co- 
culturing HUVECs on smooth muscle cells and using a tubule formation assay. Briefly, on 
day 1 a Collagen solution (Collagen type I solution, Sigma C8919)was diluted 1 to 10 with 
1 5 PBS and used to coat each well of 24 well plate (250ul). Plates were incubated at 37°C for 4- 
5 hrs. Well were then aspirated and allowed to dry overnight. On day 2, stock flask(s) of 
PASMCs (Clonetics Pulmonary Artery SMCs CC-2581, fewer than 5 passages) were 
trypsinized to provide enough cells to plate 60,000 cells per well. Cells were cultured in 
Smooth muscle cell growth medium (Clonetics SmGM-2 bullet kit CC-3182) and stock 
20 cultures were maintained according to Clonetics protocols. 

PASMCs were nearly confluent on day 3. Trypsinize Stock flask(s) of 
HUVECs were trypsinized to generate enough cells to plate 12,000 cells per well (trypsin, 
Clonetics ReagentPack CC-5034). The SmGM-2 medium was removed and HUVECS were 
plated onto the PASMCs in EGM-2 (Endothelial cell growth medium, Clonetics EGM-2 
25 bullet kit CC-3 1 62) at 1ml per well. 

Medium was exchanged every 2-3 days and tubule formation was assessed. 
Using the tubule formation assay, GFP-AN-Plakoglobin expression was found to inhibit 
branch formation. (Data not shown.) 

Expression of GFP-AN-Plakoglobin downregulated endogenous 0-catenin 
30 levels, as well as expression of endogenous plakoglobin and VE-cadherin. (See, e.g., Figure 
5.) /?-catenin was shown to be required to maintain ov/33 expression. (Data not shown.) 

In addition, GFP-AN-Plakoglobin expression upregulated surface levels of the 
laminin receptor, a6(34. (See, e.g., Figure 6.) 
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Example 2: Identification of HoxB2 angiogenesis regulatory protein. 

Using the functional assay described above, expression of a HoxB2-GFP 
fusion protein {i.e., AN-HoxB2) was found to decrease levels of the cell surface marker av/33. 
Cells transfected with the AN-HoxB2-GFP fusion protein construct exhibit lower av/33 
5 expression levels than control cells. The HoxB2 functional screen hit is an N-terminal 
deleltion that contains the HoxB2 homeodomain. {See, e.g., Figure 7.) 

AN-HoxB2 downregulated both ocv and j33 integrin subunit surface expression, 
as well as ovj85 surface expression. {See, e.g., Figure 8.) AN-HoxB2 also downregulated /33 
transcript expression. (Data not shown.) AN-HoxB2 inhibited proliferation in HUVECs, a 
10 primary cell line. {See, e.g., Figure 9.) In addition, full length HoxB2 phenocopied AN- 
HoxB2 and downregulated surface expression of ovj83. {See, e.g., Figure 10.) 

Cells expressing full length HoxB2 or AN-HoxB2 were isolated by cell sorting 
and assayed for tube formation using the co-culture assay described above. Both full length 
HoxB2 and AN-HoxB2 increased branch formation in the co-culture assay. (Data not 
15 shown.) av|33 expression was directly downregulated using #3 siRNA. Downregulation of 
ovjS3 expression did not affect branch formation in the co-culture assay. 

Example 3: Identification of SUSP-1 angiogenesis regulatory protein. 

Using the functional assay described above, expression of a SUSP-1 GFP 

20 fusion protein {i.e., GFP-SUSP-1) was found to decrease levels of the cell surface marker 

ov/33. {See, e.g., Figure 11.) Cells transfected with the GFP-SUSP-1 fusion protein construct 
exhibit lower ovj33 expression levels than control cells. The SUSP-1 functional screen hit is 
an N-terminal fusion in the sense orientation and that has an additional encoded cDNA insert 
before the wild-type start codon. {See t e.g., Figure 11.) 

25 The GFP-SUSP-1 screening hit did not affect proliferation when expressed in 

HUVEC (human umbilical vein endothelial cells), PASMC (pulmonary artery smooth muscle 
cells), and NHDF (normal human dermal fibroblasts). {See, e.g., Figure 12.) SUSP-1 mRNA 
was found to be expressed in a variety of cell lines. {See, e.g., Figure 13.) 

GFP-SUSP-1 expression inhibited tube formation in the co-culture assay 

30 described above. (Data not shown.) Two SUSP-1 siRNAs were synthesized and found to 
inhibit SUSP-1 expression. {See, e.g., Figure 14.) The SUSP-1 siRNAs also inhibited tube 
formation in the co-culture assay. (Data not shown.) 
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Example 4: Identification of A5FADS angiogenesis regulatory protein. 

Using the functional assay described above, expression of a A5FADS 
antisense nucleic acid (/.<?., GFP-A5FADS) was found to decrease levels of the cell surface 
marker avj33. {See, e.g., Figure 30.) Cells transfected with the GFP-A5FADS construct 
5 exhibit lower <xvp3 expression levels than control cells. The A5FADS functional screen hit is 
a 300bp antisense nucleic acid. (See, e.g., Figure 30.) 

The GFP-A5FADS screening hit did not affect proliferation when expressed in 
HUVEC (human umbilical vein endothelial cells), PASMC (pulmonary artery smooth muscle 
cells), andNHDF (normal human dermal fibroblasts). (See, e.g., Figure 31.) A5FADS 
10 mRNA was found to be expressed in a variety of cell lines, and appeared to be highly 

expressed in endothelial cells, hepatocytes, and mammary epithelial cells. (Data not shown.) 

Antisense A5FADS nucleic acids inhibited tube formation in the co-culture 
assay described above. (Data not shown.) Two A5FADS siRNAs were synthesized and 
found to inhibit A5FADS expression. (See, e.g., Figure 32.) The ASF ADS siRNAs inhibited 
15 tube formation in the co-culture assay. (Data not shown.) The A5FADS siRNAs inhibited 
vitronectin haptotaxis and reduced SDF-1 induced chemotaxis in Jurkat cells. (See, e.g., 
Figures 33-34.) 

Example 5: Identification of A6FADS angiogenesis regulatory protein. 

20 Using the functional assay described above, expression of a KIAA1 583 

antisense nucleic acid (i.e., GFP-KIAA1583) was found to decrease levels of the cell surface 
marker av/33. (Data not shown.) KIAA1583 and A6FADS genomic sequences are both 
located on chromosme 1 lql2, and transcripts of the two genes are spiced together. (See, e.g., 
Figure 35.) A6FADS mRNA is reduced in cells expressing GFP-KIAA1583, the antisense 

25 screening hit. (See, e.g., Figure 36.) A6FADS also interacts with CI phosphatase, another 
avb3 screening hit. (Data not shown.) A6FADS is highly expressed in endothelial cells, 
hepatocytes and skeletal muscle cells. (Data not shown.) 

A6FADS siRNAs were synthesized and found to inhibit A5FADS expression. 
(See, e.g., Figure 37.) The A6FADS siRNAs inhibited tube formation in the co-culture assay 

30 described above. (Data not shown.) Expression of the A6FADS siRNAs also downregulated 
avb3 surface expression levels. (See, e.g., Figure 37.) 
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All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily apparent to 
one of ordinary skill in the art in light of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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1 1 . A method for identifying a compound that modulates angiogenesis, 

2 the method comprising the steps of: 

3 (i) contacting the compound with a nucleic acid, or a polypeptide or a 

4 fragment thereof encoded by a nucleic acid, wherein the nucleic acid hybridizes under 

5 stringent conditions to a second nucleic acid comprising a nucleotide sequence selected 



6 from the group consisting of SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID 

7 NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID 

8 NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID 

9 NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID NO:169, SEQ ID 

10 NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID NO:202, SEQ ID 

1 1 NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID 

12 NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID 

13 NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID 

14 NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID NO:351, SEQ ID 

15 NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID NO:408, SEQ ID 

16 NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID NO:439 SEQ ID 

17 NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID NO:484, SEQ ID 

18 NO:493, SEQ ID NO:496, SEQ ED NO:498, SEQ ID NO:519, SEQ ID NO:521, and SEQ 

19 K>NO:523;and 



20 (ii) determining the functional effect of the compound upon the nucleic 

2 1 acid or polypeptide. 

1 2. The method of claim 1, wherein the functional effect is determined 

2 in vitro, 

1 3. The method of claim 2, wherein the functional effect is a physical 

2 effect. 

1 4. The method of claim 2, wherein the functional effect is determined 

2 by measuring ligand or substrate binding to the polypeptide. 

1 5. The method of claim 2, wherein the functional effect is a chemical 

2 effect. 
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1 6. The method of claim 2, wherein the functional effect is determined 

2 by measuring an enzymatic activity. 

1 7. The method of claim 1 , wherein the polypeptide is expressed in a 

2 eukaryotic host cell. 

1 8 . The method of claim 7, wherein the functional effect is a physical 

2 effect. 

1 9. The method of claim 8, wherein the functional effect is determined 

2 by measuring ligand or substrate binding to the polypeptide. 

1 10. The method of claim 7, wherein the functional effect is a chemical 

2 orphenotypic effect. 

1 11. The method of claim 1 0, wherein the functional effect is 

2 determined by measuring an enzymatic activity. 

1 12. The method of claim 7, wherein the host cell is an endothelial cell. 

1 13. The method of claim 12, wherein the functional effect is 

2 determined by measuring av/?3 expression. 

1 14. The method of claim 12, wherein the functional effect is 

2 determined by measuring a member selected from av/33 expression, haptotaxis, and 

3 chemotaxis. 

1 15. The method of claim 1 4, wherein the functional effect is 

2 determined by measuring av(33 expression. 

1 16. The method of claim 1 , wherein modulation is inhibition of 

2 angiogenesis. 

1 17. The method of claim 1 wherein the polypeptide is recombinant. 

1 18. The method of claim 1 , wherein the nucleic acid comprises a 

2 nucleotide sequence selected from the group consisting of SEQ ID NO:3, SEQ ID NO:32, 

3 SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ 
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4 ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID 

5 NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID 

6 NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID 

7 NO.202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID 

8 NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID 

9 NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID 

10 NO:323, SEQ ID NO:324, SEQ ED NO:329, SEQ ED NO.330, SEQ ED NO:340, SEQ ED 

1 1 NO:351, SEQ ED NO:365, SEQ ED NO:377, SEQ ED NO:384, SEQ ED NO:406, SEQ ED 

12 NO.-408, SEQ ED NO:419, SEQ ED NO:421, SEQ ED NO:428, SEQ ED NO:437, SEQ ED 

1 3 NO:439 SEQ ED NO:445, SEQ ID NO:456, SEQ ED NO:462, SEQ ED NO:48 1 , SEQ ED 

14 NO:484, SEQ ED NO:493, SEQ ED NO:496, SEQ ED NO:498, SEQ ED NO:519, SEQ ED 

15 NO:521,andSEQEDNO:523. 

1 19. The method of claim 1, wherein the polypeptide comprises a 

2 sequence selected from the group consisting of SEQ ED NO:4, SEQ ED NO:33, SEQ ED 

3 NO:44, SEQ ED NO:58, SEQ ID NO:64, SEQ ED NO:69, SEQ ED NO:71, SEQ ED 

4 NO:77, SEQ ED NO:82, SEQ ED NO:87, SEQ ED NO:90, SEQ ED NO:121, SEQ ED 

5 NO:129, SEQ ED NO:140, SEQ ED N0.154, SEQ ED NO:164, SEQ ED NO:170, SEQ ED 

6 NO: 1 72, SEQ ED NO: 174, SEQ ED NO: 176, SEQ ED NO: 1 84, SEQ ED NO:203, SEQ ED 

7 NO:287, SEQ ED NO:298, SEQ ED NO:309, SEQ ED NO:319, SEQ ED NO:325, SEQ ED 

8 NO:331, SEQ ED NO:341, SEQ ED NO:352, SEQ ED NO:366, SEQ ED NO:378, SEQ ED 

9 NO:385, SEQ ED NO:407, SEQ ED NO:409, SEQ ED NO:420, SEQ ED NO:429, SEQ ED 

1 0 NO:438, SEQ ED NO.440, SEQ ED NO:446, SEQ ED NO:457, SEQ ED NO:463, SEQ ED 

1 1 NO:482, SEQ ED NO:485, SEQ ED NO:494, SEQ ED NO:497, SEQ ED NO:499, SEQ ED 

12 NO:520, SEQ ED NO:522, and SEQ ED NO:524. 

1 20. The method of claim 1 , wherein the compound is an antibody. 

1 21 . The method of claim 1, wherein the compound is an antisense 

2 molecule. 

1 22. The method of claim 1, wherein the compound is a small organic 

2 molecule. 

1 23 . The method of claim 1 , wherein the compound is a peptide. 
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1 24. A method for identifying a compound that modulates angiogenesis, 

2 the method comprising the steps of: 

3 (i) contacting the compound with a nucleic acid, or a polypeptide or a 



4 fragment thereof encoded by a nucleic acid, wherein the nucleic acid hybridizes under 

5 stringent conditions to a second nucleic acid comprising a nucleotide sequence selected 

6 from the group consisting of SEQ ID NO:3, SEQ ID NO:32 s SEQ ID NO:43, SEQ ID 

7 NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID 

8 NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:12S, SEQ ID 

9 NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID N0.169, SEQ ID 

10 NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID NO:202, SEQ ID 

11 NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ IDNO:248, SEQ ID 

12 NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID 

13 NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID 

14 NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID NO:351, SEQ ID 

15 NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID NO:408, SEQ ID 

16 NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID NO:439 SEQ ID 

17 NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481 5 SEQ ID NO:484, SEQ ID 

18 NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:519, SEQ ID NO:521, and SEQ 

19 IDNO:523; 

20 (ii) determining the functional effect of the compound upon the nucleic 

21 acid or polypeptide; and 

22 (iii) expressing the nucleic acid or polypeptide in a cell, contacting the 

23 nucleic acid or polypeptide with the compound, and determining the phenotypic or 

24 chemical effect upon the cell. 

1 25. A method of modulating angiogenesis in a subject, the method 

2 comprising the step of administering to the subject a therapeutically effective amount of a 

3 compound identified using the method of claim 1 . 

1 26. The method of claim 25, wherein the subject is a human. 

1 27. The method of claim 25, wherein the compoimd is an antibody. 

1 28. The method of claim 25, wherein the compound is an antisense 

2 molecule. 
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molecule. 



29. The method of claim 25, wherein the compound is a small organic 



30. The method of claim 25, wherein the compound is a peptide. 



1 31. The method of claim 25, wherein the compound is an RNAi 

2 molecule. 



32. The method of claim 25, wherein the compound inhibits 



angxogenesis 



33. A method of modulating angiogenesis in a subject, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
polypeptide, the polypeptide encoded by a nucleic acid that hybridizes under stringent 
conditions to a second nucleic acid comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO:63, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, 
SEQ ID NO:89, SEQ ID NO.120, SEQ ID NO:128, SEQ ID NO:165, SEQ ID NO:183, 
SEQ ID NO:202, SEQ ED NO:218, SEQ ID NO:232, SEQ ID NO:274, SEQ ID NO:285, 
SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, 
SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:340, SEQ ID NO:377, SEQ ID NO:384, 
SEQ ID NO:406, SEQ ID NO:408, SEQ ID NO:439 SEQ ID NO:445, SEQ ID NO:456, 
SEQ ID NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, and SEQ ID 
NO:498. 

34. A method of modulating angiogenesis in a subject, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
nucleic acid that hybridizes under stringent conditions to a second nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:3, 
SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ 
ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID 
NO.120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID 
NO:165, SEQ ID NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID 
NO: 183, SEQ ID NO:202, SEQ ID NO.210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID 
N0.232, SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID 
NO:297, SEQ ID NO:307, SEQ ID NO.308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID 
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12 NO:320, SEQ ID NO.323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID 

13 NO:340, SEQ ID NO:35 1, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID 

14 NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID 

15 NO:437, SEQ ID NO:439 SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID 

16 NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID 

17 NO:519, SEQ ID NO:521, and SEQ ID NO:523. 

1 35. An isolated nucleic acid, wherein the nucleic acid acid hybridizes 

2 under stringent conditions to a polynucleotide comprising a nucleotide sequence selected 

3 from the group consisting of SEQ ID NO:165, SEQ ID NO:202, SEQ ID NO:210, SEQ 

4 ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID NO:274, SEQ 

5 ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID NO:308, SEQ 

6 ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ 

7 ID:329, and SEQ ID:330. 

1 36. The isolated nucleic acid of claim 35, wherein the nucleic acid 

2 comprises a nucleotide sequence selected from the group consisting of SEQ ID NO:165, 

3 SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ED NO:232, 

4 SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, 

5 SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, 

6 SEQ ID NO:323, SEQ ID NO:324, SEQ ID:329, and SEQ ED:330. 

1 37. An isolated polypeptide comprising an amino acid sequence 

2 selected from the group consisting of SEQ ID NO:287, SEQ ID NO:298, SEQ ID 

3 NO:309, SEQ ID NO:319, SEQ ID NO:325, and SEQ ID NO:331. 
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Figure 15 

A new sequence determination for the ABC transporter gene, SEQ ID NO:2. 
ABC transporter 

GGACGCGCCTGGTGCCCCGGGGAGGGGCGCCACCGGGGGAGGAGGAGGAG 

GAGAAGGTGGAGAGGAAGAGACGCCCCCTCTGCCCGAGACCTCTCAAGGC 

CCTGACCTCAGGGGCCAGGGCACTGACAGGACAGGAGAGCCAAGTTCCTC 

CACTTGGGCTGCCCGAAGAGGCCGCGACCCTGGAGGGCCCTGAGCCCACC 

GCACCAGGGGCCCCAGCACCACCCCGGGGGCCTAAAGCGACAGTCTCAGG 

GGCCATCGCAAGGTTTCCAGTTGCCTAGACAACAGGCCCAGGGTCAGAGC 

AACAATCCTTCCAGCCACCTGCCTCAACTGCTGCCCCAGGCACCAGCCCC 

AGTCCCTACGCGGCAGCCAGCCCAGGTGACATGCCGGTGCTCTCCAGGCC 

CCGGCCCTGGCGGGGGAACACGCTGAAGCGCACGGCCGTGCTCCTGGCCC 

TCGCGGCCTATGGAGCCCACAAAGTCTACCCCTTGGTGCGCCAGTGCCTG 

GCCCCGGCCAGGGGTCTTCAGGCGCCCGCCGGGGAGCCCACGCAGGAGGC 

CTCCGGGGTCGCGGCGGCCAAAGCTGGCATGAACCGGGTATTCCTGCAGC 

GGCTCCTGTGGCTCCTGCGGCTGCTGTTCCCCCGGGTCCTGTGCCGGGAG 

ACGGGGCTGCTGGCCCTGCACTCGGCCGCCTTGGTGAGCCGCACCTTCCT 

GTCGGTGTATGTGGCCCGCCTGGACGGAAGGCTGGCCCGCTGCATCGTCC 

GCAAGGACCCGCGGGCTTTTGGCTGGCAGCTGCTGCAGTGGCTCCTCATC 

GCCCTCCCTGCTACCTTCGTCAACAGTGCCATCCGTTACCTGGAGGGCCA 

ACTGGCCCTGTCGTTCCGCAGCCGTCTGGTGGCCCACGCCTACCGCCTCT 

ACTTCTCCCAGCAGACCTACTACCGGGTCAGCAACATGGACGGGCGGCTT 

CGCAACCCTGACCAGTCTCTGACGGAGGACGTGGTGGCCTTTGCGGCCTC 

TGTGGCCCACCTCTACTCCAACCTGACCAAGCCACTCCTGGACGTGGCTG 

TGACTTCCTACACCCTGCTTCGGGCGGCCCGCTCCCGTGGAGCCGGCACA 

GCCTGGCCCTCGGCCATCGCCGGCCTCGTGGTGTTCCTCACGGCCAACGT 

GCTGCGGGCCTTCTCGCCCAAGTTCGGGGAGCTGGTGGCAGAGGAGGCGC 

GGCGGAAGGGGGAGCTGCGCTACATGCACTCGCGTGTGGTGGCCAACTCG 

GAGGAGATCGCCTTCTATGGGGGCCATGAGGTGGAGCTGGCCCTGCTACA 

GCGCTCCTACCAGGACCTGGCCTCGCAGATCAACCTCATCCTTCTGGAAC 

GCCTGTGGTATGTTATGCTGGAGCAGTTCCTCATGAAGTATGTGTGGAGC 

GCCTCGGGCCTGCTCATGGTGGCTGTCCCCATCATCACTGCCACTGGCTA 

CTCAGAGTCAGATGCAGAGGCCGTGAAGAAGGCAGCCTTGGAAAAGAAGG 

AGGAGGAGCTGGTGAGCGAGCGCACAGAAGCCTTCACTATTGCCCGCAAC 

CTCCTGACAGCGGCTGCAGATGCCATTGAGCGGATCATGTCGTCGTACAA 

GGAGGTGACGGAGCTGGCTGGCTACACAGCCCGGGTGCACGAGATGTTCC 

AGGTATTTGAAGATGTTCAGCGCTGTCACTTCAAGAGGCCCAGGGAGCTA 

GAGGACGCTCAGGCGGGGTCTGGGACCATAGGCCGGTCTGGTGTCCGTGT 

GGAGGGCCCCCTGAAGATCCGAGGCCAGGTGGTGGATGTGGAACAGGGGA 

TCATCTGCGAGAACATCCCCATCGTCACGCCCTCAGGAGAGGTGGTGGTG 

GCCAGCCTCAACATCAGGGTGGAGGAAGGCATGCATCTGCTCATCACAGG 

CCCCAATGGCTGCGGCAAGAGCTCCCTGTTCCGGATCCTGGGTGGGCTCT 

GGCCCACGTACGGTGGTGTGCTCTACAAGCCCCCACCCCAGCGCATGTTC 

TACATCCCGCAGAGGCCCTACATGTCTGTGGGCTCCCTGCGTGACCAGGT 

GATCTACCCGGACTCAGTGGAGGACATGCAAAGGAAGGGCTACTCGGAGC 

AGGACCTGGAAGCCATCCTGGACGTCGTGCACCTGCACCACATCCTGCAG 

CGGGAGGGAGGTTGGGAGGCTATGTGTGACTGGAAGGACGTCCTGTCGGG 

TGGCGAGAAGCAGAGAATCGGCATGGCCCGCATGTTCTACCACAGGCCCA 

AGTACGCCCTCCTGGATGAATGCACCAGCGCCGTGAGCATCGACGTGGAA 

GGCAAGATCTTCCAGGCGGCCAAGGACGCGGGCATTGCCCTGCTCTCCAT 

CACCCACCGGCCCTCCCTGTGGAAATACCACACACACTTGCTACAGTTCG 

ATGGGGAGGGCGGCTGGAAGTTCGAGAAGCTGGACTCAGCTGCCCGCCTG 

AGCCTGACGGAGGAGAAGCAGCGGCTGGAGCAGCAGCTGGCGGGCATTCC 

CAAGATGCAGCGGCGCCTCCAGGAGCTCTGCCAGATCCTGGGCGAGGCCG 
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TGGCCCCAGCGCATGTGCCGGCACCTAGCCCGCAAGGCCCTGGTGGCCTC 
CAGGGTGCCTCCACCTGACACAACCGTCCCCGGCCCCTGCCCCGCCCCCA 
AGCTCGGATCACATGAAGGAGACAGCAGCACCCACCCATGCACGCACCCC 
GCCCCTGCATGCCTGGCCCCTCCTCCTAGAAAACCCTTCCCGCCCTCGGG 
AAAGTAGATGTGGAGGGTGGCGCCCTGCGTAACCCTCGCCCTGTCCCTCC 
CACTCCCTGGGGGCGCTGTTCCACAGTGACTGGGCCCTGTCCAGGGCAGT 
GAGTCCTCTACTTTGCTCCGTGGAGGAAGCTGGGGTACAAGGGGCCCAGT 
GCTGGCCACACAGCAGCGCAGCCGAGCCCCAGGAGCCCGTCAGGCCACAG 
CCCCTGGCACTGCAGGTGGCCTCCCTCCAGAGACTCGAGTCCCCATGATT 
CCCTCCTCGTCAGTCTCTCAAAGACCCCATGGTCCATCCCCTGAGGGTGG 
TCAGCCAAGGCTCCCGTTCCGTGGGATGCCATAAAAGCCGCCCAGTGGGA 
CCCACAGTCACACAGAGCGCCTCACCTGCATCCTCTCCCCCACAAGAGCC 
CCAAAGATCCCACGGGAGAGGGGAGAGGGACGCACAGCACTGCCTGCCAA 
GCGAGAATGCAGGCCCCGCCCCCTCGGCCCCTCACCACCTCTTTCTACAG 
CCTAATTTATTGGATTCCCTATTCGTAGCCATCTCCGTGGCCAATGTGAC 
TACCGTGCCAGCAGCGGGGGCGGCCCAGCCTCTGAGTCCCGTGGGGCCCC 
GGCTCCCACCGGTGCCAAACCCAGCCCCTGCGGCCGTCACCCCGCCAGCC 
TACACTGCCAGCCGCCACCGGGGCACACGGGCCTCTGCTTGCCAGCCAGG 
AGTGCGGAC^CCATGTTCCCAGCrCAGTGCCAAAGAGGGGTCACCAGGGG 
GAGCTGTCTGCGGAGCCAGCGCCTGCCCGAGAGAGACCCCACCGCCACCG 
TGTGCCTTTCCCGGGCCCTCAGCCCTCGGGCCGGGCACCACCCCCAGTCC 
CCCCAGTAAAAGCCTCCACTGGCAAAAAAAAAAAAAAAAAAAA 
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A new sequence determination, SEQ ID NO: 127 for HSPA5 
SEQIDNO:127 

HSP5> 

ATGGAGGAGGACAAGAAGGAGGACGTGGGCACGGTGGTCGGCATCGACCT 

GGGGACCACCTACTCCTGCGTCGGCGTGTTCAAGAACGGCCGCGTGGAGA 

TCATCGCCAACGATCAGGGCAACCGCATCACGCCGTCCTATGTCGCCTTC 

ACTCCTGAAGGGGAACGTCTGATTGGCGATGCCGCCAAGAACCAGCTCAC 

CTCCAACCCCGAGAACACGGTCTTTGACGCCAAGCGGCTCATCGGCCGCA 

CGTGGAATGACCCGTCTGTGCAGCAGGACATCAAGTTCTTGCCGTTCAAG 

GTGGTTGAAAAGAAAACTAAACCATACATTCAAGTTGATATTGGAGGTGG 

GC7VAACAAAGACATTTGOTCCTGAAGAAATTTCTGCCATGGTTCTCACTA 

AAATGAAAGAAACCGCTGAGGCTTATTTGGGAAAGAAGGTTACCCATGCA 

GTTGTTACTGTACCAGCCTATTTTAATGATGCCCAACGCCAAGCAACCAA 

AGACGCTGGAACTATTGCTGGCCTAAATGTTATGAGGATCATCAACGAGC 

CTACGGCAGCTGCTATTGCTTATGGCCTGGATAAGAGGGAGGGGGAGAAG 

AACATCCTGGTGTTTGACCTGGGTGGCGGAACCTTCGATGTGTCTCTTCT 

CACCATTGACAATGGTGTCTTCGAAGTTGTGGCCACTAATGGAGATACTC 

ATCTGGGTGGAGAAGACTTTGACCAGCGTGTCATGGAACACTTCATCAAA 

CTGTACAAAAAGAAGACGGGCAAAGATGTCAGGAAAGACAATAGAGCTGT 

GCAGAAACTCCGGCGCGAGGTAGAAAAGGCCAAACGGGCCCTGTCTTCTC 

AGCATCAAGCAAGAATTGAAATTGAGTCCTTCTATGAAGGAGAAGACTTT 

TCTGAGACCCTGACTCGGGCCAAATTTGAAGAGCTCAACATGGATCTGTT 

CCGGTCTACTATGAAGCCCGTCCAGAAAGTGTTGGAAGATTCTGATTTGA 

AGAAGTCTGATATTGATGAAATTGTTCTTGTTGGTGGCTCGACTCGAATT 

CCAAAGATTCAGCAACTGGTTAAAGAGTTCTTCAATGGCAAGGAACCATC 

CCGTGGCATAAACCCAGATGAAGCTGTAGCGTATGGTGCTGCTGTCCAGG 

CTGGTGTGCTCTCTGGTGATCAAGATACAGGTGACCTGGTACTGCTTGAT 

GTATGTCCCCTTACACTTGGTATTGAAACTGTGGGAGGTGTCATGACCAA 

ACTGATTCCAAGGAACACAGTGGTGCCTACCAAGAAGTCTCAGATCTTTT 

CTACAGCTTCTGATAATCAACCAACTGTTACAATCAAGGTCTATGAAGGT 

GAT^AGACCCCTGACAAAAGACAATCATCTTCTGGGTACATTTGATCTGAC 

TGGAATTCCTCCTGCTCCTCGTGGGGTCCCACAGATTGAAGTCACCTTTG 

AGATAGATGTGAATGGTATTCTTCGAGTGACAGCTGAAGACAAGGGTACA 

GGGAACAAAAATAAGATCACAATCACCAATGACCAGAATCGCCTGACACC 

TGAAGAAATCGAAAGGATGGTTAATGATGCTGAGAAGTTTGCTGAGGAAG 

ACAAAAAGCTCAAGGAGCGCATTGATACTAGAAATGAGTTGGAAAGCTAT 

GCCTATTCTCTAAAGAATCAGATTGGAGATAAAGAAAAGCTGGGAGGTAA 

ACTTTCCTCTGAAGATAAGGAGACCATGGAAAAAGCTGTAGAAGAAAAGA 

TTGAATGGCTGGAAAGCCACCAAGATGCTGACATTGAAGACTTCAAAGCT 

AAGAAGAAGGAACTGGAAGAAATTGTTCAACCAATTATCAGCAAACTCTA 

TGGAAGTGCAGGCCCTCCCCCAACTGGTGAAGAGGATACAGCAGAACTCC 

ACCACCACCACCACCAC 
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Protein disulfide isomerase, SEQ ID NO: 165. 
SEQIDNO:165 

Novel protein disulfide isomerase 
Novel (disulfide disomerase-like) 
CGGACCAACACAGTATTGAGTCAACTGTGACCTTAAG 

GAACTCGTGGGCAGGTAAACTATGCTTTGGATGTGCTTTCTTTCACCAAAATCACTCAACTCAGGAGCCACAAATAG 
TCCAGCAATTTCATTTCCCTCAACGCTATTTTAGTCTC^ 

GGGATATATCGCCACTGAAAATGTTTACACAGTGACCATGAGTTACACATTTACTTAGAGAAACTTAACTTAATAAA 
GAATCTGTAGAGTGTGTTGGCTTGGAAAACACACACACAAAGAAGATACCTCACGCTTAGTATGTTCTGCTTTCTGA 
ACAGCCACCACTGGGAACCCAGTGGCCTCTGTGGGACTGAACTCCTAAACGCAGGGTGCGGGAGCTGGGCAGGAGAG 
GTGACCTCCAACTGTGTTCCTAAAGTTCGTCTTTCGCTTGGCTCAGGACAAAGCGGTGTAACGAGTCAAAGTCTCTG 
CCTCCACTGTGCTCACTGACTTTCTTCCCTCCTCGGAAAAGCAATAACGTGGGGTAGCCTCGT 
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Figure 18 

Chromosome 1 protein 
SEQIDNO:210 

Clone 30 -Novel (Chromosome 1) > 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATT 

CCACCTGTTACTTTGACAACTCTAGCTTCAG^ 

GTCTCCAGAGGCATAAGCTCCAACCCCCAGAGAGCTCCAC^GCATTCTTCTCCTTCTAGTAGTACTTTTGGTTTCCA 
GG AAAATCATCTCATGCTCGGGCCACAGTCACAGTCCTGACCAGCACCATAGAAATTTCAGCCATTATTCTAGTTGT 
ATGGAGTCATTCAAAATACTAGTTCCGAAAGAAGTTTCA^GGTCCAACGCCGAGCATCTTTTACCTT 
TTCCATAAATTGAAAGCCTT 
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Figure 19 
Chromosome 3 protein 
SEQlDNO:218 

Clone 95 -Novel (Chromosome 3/ H41) > 

GCGTCGCTCGGCGTTAGCCAAGGCCCGGGCGGCGCCACCCTCCGGGGGCACTAGGTCTGGGGCCGCAGTGCCCAGCA 
CAGAGCAGCGTTTATCGGGACGAAGATGAATGGAAAGAATTGGAGCAAAAAGAGGTTGATTACAGCGGCCTCAGGGG 
TTCAGGCAATGCAAATAAGCAGTGAAAAGGAAGAAGACGATAATGAAAAG 
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Chromosome 17 



SEQ ID NO:227 
* Clone 147 - Novel (Chromosome 17) > 

CTGCTTCTCTTCTAGGATAGTTTCCCTCTAGAAATCCATGT 

GCCTCCTTAGCTAAGAGGCCTTCCTTATCTACCCTAACTAAGAGAATCCCCATATCACTCTTATCCTTTTATCCTGA 
9 TTTATTTTTCCATAAAACTTGCCACATTACTC^ 

CTTCCCTACTATAATATGAGCTCCATGAGGCAGAGACTTTGTTTTGTTCACTGCTGAATTCCCAGCACCTACAACAA 

GCCTACTCACTAAACATTTGTTGAATTAATCTTCAGGATACAT 

GTGATGCATAGGAAAATGTCTGGAAGGGTACATACCAAAATGTTA^ 

TCTTTCCTTTTCTTTTCTTTTTGAGACAGAGTTTCGCTCTCGTTGCCCAGGCTGGAGTGGAATGGCAAG 
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Figure 21 

Chromosome 8 protein 

SEQ ID NO:232 

Novel (chromosome 8)> 

GCTTGCAAAGGAGAGGCTGTGACTACCAAGTCGTGTCAACAACTGAATGGCTGAAATACCCAAACTTGCCCATGCAA 
ATGGGCTTGGGTCTCTCCTGGCAGCCGCCTTTGAAGGCTCTAGACTTATCTGTGAACTCCTTTTTTGAGAGGGTCTT 
TCCAACTAGTGGTTTATTCTTTGACTCTCCTCATACCTTTTTTGCCAGAGAGTGAGAGTGAGAAGGGAGGGCTAATG 
CCTGAGCTCCTGCCCTTTCTATGCAGTGAGGGTCAAGATCCTCAGCTAGTGTTTGAGGGAACTGGTGTAACCTGGGT 
CTCTCATTTTCTACCATCCAAGTTGCC 
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Figure 22 

Chromosome 9 protein 
SEQ ID NO:248 

cSa^ 

CTCCGCCAGGCCCCCAGCGCACAACCATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCCAG 



23/38 



DT15 ReojKCT/PTO 2 3 FEB 2005 



WO 2004/019893 




PCT/US2003/027523 



Figure 23 
1/226 protein 
SEQ ID NO:274 

Clone 1/226 - Novel (Clone 1/226) > 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCTTC 
TTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCC 

CAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCAC^CTTCACTTCATCCCTTCTACCCA 

TCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTT 

AACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATATAAAAATATAG 
ACTTTTAAAAAAAAAAAAAAAAAAAAA 
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Figure 24 

A new sequence determination for FIJI 0688 
SEQ ID NO:285 

Novel (FLJ10688) > 

CACCCAGCACCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGAAG 
CGACACTTGAAGTTAGCACGCGTTGAGAGTCAAAATGGCAT^ 

CTGTGCCACTGTTGATAGCTACC7VTCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAATGGAAAA 
AGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCAAATACTAC 
TTTG 
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Figure 25 

A new sequence determination for KIAA1583 
SEQ ID NO:307 

Clone 89 -Novel (KIAA1583)> 

ACTGCAGGTGGCAGCCACGGGCGCGCCGCTCGGCCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGGCCTGCAGGGC 
GGGGAGGCCGGGACTGGCCGTCAGCGCTGAACGGCCCAGCCTGCCCAGGGCCCAGCTGCTGGAGACCCGCAGCTCGT 
CCCCGGCGGCTCCTAATCACCAGCAGCTCCTGTTTCTCAAAACGCAGACATCCGCCCCTCTTGGGGTCAGGCCCTTC 
CACCCTGCAGGCGAGCCGCCCCAGCCCACTCCCGACTGGCGCTATGCCTCGATCACCGCTCTTGCTCCCAAGTGGAC 
CGCAGGGGAGACGCTCTCTTACGGGGACCCTGGGGGCGCTCACTCTCTGAAGGGCCTGGAAGCTAGATTCCAGAGGC 
GTGGGCCACCTCTCCCTGGGTTTTGGGGAGCCCCCTCCGAGGGTGTTCATTTCCTGAGCTCTGTGTCATCTTAGGCT 
CTGAGGGT 
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Figure 26 

A new sequence determination for KIAA1814 
SEQIDNO:317 

Clone 75 -Novel (KIAA1814)> 

CTGGACGGCCTGGCTGGGCTGAAGGGCGAGGACAGCCGCAGCAAGGAGGCAGGGGAGGGCGGCCTACCGCTGTGCGG 
GCCCACGGACAAGACCCCACTGCTGAGCGGCAAGGC 

ACAACCTCTTCATCTCTGCGGCGGCCGTGCCTCCCGGAAGCCTCCTCAGCGGCCCGGCCTGGCCCCGGCGGCGTCCT 
CCGCAGGCGGCGCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGACCGCAGTTCGCG 
CTCGGCCCCATGTCCCTGCAGGCCAACCTCGGCTCCGTGGCCGGCTCCTCCGTGCTGC71GTCGCTGTTCAGCTCTGT 
GCCGGCCGCCGCAGGCCTGGTGCACGTGTCGTCCGCTGCCACCAGACTGACCAACTCGCACGCCATGGGCAG 
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Figure 27 

Chromosome 4 protein 
SEQ ID NO.320 

Novel (maps to chromosome 4) 
Novel (Chromosome 4)> 

GGAATAAAAGAGTGGAAATGGGGATTTCCAGGTGCTCCCCTGGTT^ 

CTATCATGAATCTCCTTGGAATGCTCATTITTAGT 

GTATCTCAAGAAAATATTTTATGATTAGTAAGCTTAT^ 

TAATAATTTTTTATTTAATAATTAGAT^ 

GTTCAGATTTGAGAATAAACCTTGGGGTATCATra 

AGATTCTTGAATTATATATCTTACAGACTAGTCCCCAAAAGAATACGTGTTTCCTTACCTTTAATTTCTCATGGTAG 
TTAGTCTGTGAAT 
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Figure 28 



A new sequence determination for peroxidasin/melanoma antigen related protein 
SEQ ID NO:323 

Novel peroxidasin-like/ melanoma antigen> 

CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGCCA 
CCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCGCGCC 
AACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCTGCTCAA 
GCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCAGCACATCA 
CCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGGCTACGACCCC 
GGCATCAATGCTGGCATCTTCAACGCCTTC^ 

TTACCGGCTGGACGAGAACTTCCAGCCCATTGCACAAGATCACCTCCCCCTTCACAAAGCTTTCTTCTCTCCCTTCC 
GGATTGTGAATGAGGGCGGCATCGATCCGCTTCTCAGGGGGCTGTTCGGGGTGGCGGGGAAAATGCGTGTGCCCTCG 
CAGCTGCTGAACACGGAGCTCACGGAGCGGCTGTTCTCCATGGCACACACGGTGGCTCTGGACCTGGCGGCCATCAA 
CATCCAGCGGGGCCGGGACCACGGGATCCCACCCTACCACGACTACAGGGTCTACTGCAATCTATCGGCGGCACACA 
CGTTCGAGGACCTGAAAAATGAGATTAAAAACCCTGAGATCCGG 
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Figure 29 

WD40/SOCS box protein 
SEQ ID NO:329 

Novel WD40/SOCS box protein? „ 

CGGCTCTGTTGCCGTGCAAGTGGAATCATCTTACCTCTAAGACATGACTTTCACATGCCAGGGGAGAGAAAGATCCA 

TGACTAGTACACTGGCAATCTGGTTTTGCTACATTCTATTCACAATCCCAAAGAAATGCTATTTCAATGCAAGACCA 

GATGTTTGGCCCATTATTCCAGCAACTCCCTTTGACAGGACGATTTACCCTGCTACCAAAGAAGCACAAGATGTGGT 

GTTGCTTAAAAAAGTCCTGTATGTGAGGAACTCTTTCATTTTCTTGGGGATTGGCAGTGCTAGGACTTGGTAAGTTG 

TTAGGAAACTTCGAAGGGCTTTCCGGCATAAGTGCTTCAGTGAGGACAGGACCCTAGGAGCTGTCCAGAACTGGACG 

TGGCCATCTCTTGTCCCTGTGGCAATGACTCCACCATGTGGAAAAAATGTGCAGCAAAGCCCATTGGTCATAGGAGC 

AAATGCAATGGGAGTTTTCAGTTCCAGGGCCCAGATCCTGAGGAGTCTGTCATCTGCCACCGTGGCAAGGTACAAGC 

CCCCG 
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Figure 32 
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^^RIEF SUMMARY OF THE INVENT^^ 

The present application identifies, for the first time, a number of proteins and 
DNA molecules involved in regulation of angiogenesis, e.g., angiogenesis regulatory proteins 
and DNA molecules. The invention further relates to methods for identifying and using 
5 agents, including small organic molecules, antibodies, peptides, cyclic peptides, nucleic 
acids, antisense nucleic acids, RNAi, and ribozymes, that modulate angiogenesis via 
modulation of angiogenesis regulatory proteins and DNA molecules; as well as to the use of 
expression profiles and compositions in diagnosis and therapy of diseases related to 
insufficient or increased angiogenesis. 

10 In one aspect, the present invention provides a method for identifying a 

compound that modulates angiogenesis, the method comprising the steps of: 

(i) contacting the compound with an angiogenesis regulatory nucleic acid, or 
an angiogenesis regulatory polypeptide or a fragment thereof encoded by a nucleic acid, 
wherein the nucleic acid hybridizes under stringent conditions to a second nucleic acid 

15 comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:3, SEQ 
ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, 
SpQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID 
NO: 128, SEQ ID NO: 139, SEQ ID NO: 153, SEQ ID NO: 163, SEQ ID NO: 165, SEQ ID 
NO: 169, SEQ ID NO: 171, SEQ ID NO: 173, SEQ ID NO: 175, SEQ ID NO: 183, SEQ ID 

20 NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID 
NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID 
NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID 
NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID 

25 NO:408, SEQ ED NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID 
NO:439, SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID 
NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:519, SEQ ID 
NO:521, and SEQ ID NO:523; and (ii) determining the functional effect of the compound 
upon the nucleic acid or polypeptide. 

30 In one embodiment, the functional effect is determined in vitro. In another 

embodiment, the functional effect is a physical effect. In another embodiment, the functional 
effect is determined by measuring ligand or substrate binding to the polypeptide. In another 
embodiment, the functional effect is a chemical effect. In another embodiment, the 
functional effect is determined by measuring an enzymatic activity. 

2 
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leembodiment, the polypeptide is expressed urci e 



In oneembodiment, the polypeptide is expressed urcT eukaryotic host cell. In 
a further embodiment, the functional effect is a physical effect. In another embodiment, the 
functional effect is determined by measuring ligand or substrate binding to the polypeptide. 
In a further embodiment, the functional effect is a chemical or phenotypic effect. In another 
5 embodiment, the functional effect is determined by measuring an enzymatic activity. In 
another embodiment, the host cell is an endothelial cell. In a further embodiment, the 
functional effect is determined by measuring av/33 expression or haptotaxis, or chemotaxis, 
or co-culture tube formation. 

In one embodiment, modulation is inhibition of angiogenesis. 

10 In one embodiment, the polypeptide is recombinant. In another embodiment, 

the nucleic acid comprises a nucleotide sequence selected from the group consisting of SEQ 
ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, 
SEQ ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID 
NO:120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID 

15 NO:165, SEQ ID NO:169, SEQ ID NO:171, SEQ ED NO:173, SEQ ED NO:175, SEQ ED 
NO:183, SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID 
NO:232, SEQ ED NO:248, SEQ ID NO:274, SEQ ED NO:285, SEQ ED NO:286, SEQ ID 
NO:297, SEQ ED NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ED NO:318, SEQ ID 
NO:320, SEQ ID NO:323, SEQ ED NO:324, SEQ ED NO:329, SEQ ID NO:330, SEQ ID 

20 NO:340, SEQ ID NO:351, SEQ ED NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID 
NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ED NO:421, SEQ ID NO:428, SEQ ID 
NO:437, SEQ ID NO:439, SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID 
NO:481, SEQ ED NO:484, SEQ ID NO:493, SEQ ED NO:496, SEQ ID NO:498, SEQ ID 
NO:519, SEQ ID NO:521, and SEQ ID NO:523. 

25 In another embodiment, the polypeptide comprises a sequence selected from 

the group consisting of SEQ ED NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ 
ID NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO:82, SEQ ID NO:87, 
SEQ ID NO:90, SEQ ED NO:121, SEQ ID NO:129, SEQ ID NO:140, SEQ ID NO:154, SEQ 
ID NO: 164, SEQ ED NO: 170, SEQ ID NO: 172, SEQ ID NO: 174, SEQ ID NO: 176, SEQ ED 

30 NO: 1 84, SEQ ED NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID 
NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ED NO:341, SEQ ID NO:352, SEQ ID 
NO:366, SEQ ED NO:378, SEQ ID NO:385, SEQ ED NO:407, SEQ ID NO:409, SEQ ID 
NO:420, SEQ ED NO:429, SEQ ID NO:438, SEQ ED NO:440, SEQ ID NO:446, SEQ ID 
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NO:457, SEQ ID N^fe, SEQ ID NO:482, SEQ ID NO:485, S^^D NO:494, SEQ ID 
NO:497, SEQ ID NO:499, SEQ ID NO:520, SEQ ID NO:522, and SEQ ID NO:524. 

In one embodiment, the compound is an antibody, an antisense molecule, a 
small organic molecule, or a peptide. 
5 In another aspect, the present invention provides a method for identifying a 

compound that modulates angiogenesis, the method comprising the steps of (i) contacting the 
compound with a nucleic acid, or a polypeptide or a fragment thereof encoded by a nucleic 
acid, wherein the nucleic acid hybridizes under stringent conditions to a second nucleic acid 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NO:3, SEQ 
10 ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, 
SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID 
NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID 
NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID 
NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID 
1 5 NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID 
NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID 
NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID 
NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID 
20 NO:439, SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID 
NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:519, SEQ ID 
NO:521, and SEQ ID NO:523; (ii) determining the functional effect of the compound upon 
the nucleic acid or polypeptide; and (iii) expressing the nucleic acid or polypeptide in a cell, 
contacting the nucleic acid or polypeptide with the compound, and determining the 
25 phenotypic or chemical effect upon the cell. 

In another aspect, the present invention provides a method of modulating 
angiogenesis in a subject, the method comprising the step of administering to the subject a 
therapeutically effective amount of a compound identified as a modulator of angiogenesis 
using the methods described herein. In one embodiment, the subject is a human. In a further 
30 embodiment, the compound is an antibody, an antisense molecule, a small organic molecule, 
a peptide, or an RNAi molecule. In another embodiment, the compound inhibits 
angiogenesis. 

In another aspect, the present invention provides a method of modulating 
angiogenesis in a subject, the method comprising the step of administering to the subject a 
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cfive amount of a polypeptide, the polypeptide enc 



therapeutically effective amount of a polypeptide, the polypeptide encoded by a nucleic acid 
that hybridizes under stringent conditions to a second nucleic acid comprising a nucleotide 
sequence selected from the group consisting of SEQ ID NO:63, SEQ ID NO:76, SEQ ID 
NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID 
5 NO: 165, SEQ ID NO: 183, SEQ ID NO:202, SEQ ED NO:218, SEQ ID NO:232, SEQ ID 
NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ED NO:297, SEQ ID NO:317, SEQ ED 
NO:318, SEQ ED NO:320, SEQ ED NO:323, SEQ ED NO:324, SEQ ED NO:340, SEQ ED 
NO:377, SEQ ED NO:384, SEQ ED NO:406, SEQ ED NO:408, SEQ ED NO:439, SEQ ED 
NO:445, SEQ ED NO:456, SEQ ED NO:481, SEQ ED NO:484, SEQ ED NO:493, SEQ ED 

10 NO:496, and SEQ ED NO:498. 

In another aspect, the present invention provides a method of modulating 
angiogenesis in a subject, the method comprising the step of administering to the subject a 
therapeutically effective amount of a nucleic acid that hybridizes under stringent conditions 
to a second nucleic acid comprising a nucleotide sequence selected from the group consisting 

15 of SEQ ED NO:3, SEQ ID NO:32, SEQ ED NO:43, SEQ ED NO:57, SEQ ED NO:63, SEQ ED 
NO:68, SEQ ED NO:70, SEQ ED NO:76, SEQ ED NO:81, SEQ ED NO:86, SEQ ED NO:89, 
SEQ ED NO:120, SEQ ED NO:128, SEQ ED NO:139, SEQ ED NO:153, SEQ EDNO:163, 
SEQ ED NO: 165, SEQ ED NO: 169, SEQ ED NO: 171, SEQ ED NO: 173, SEQ ED NO:175, 
SEQ ED NO: 183, SEQ ID NO:202, SEQ ED NO:210, SEQ ED NO:218, SEQ ED NO:227, 

20 SEQ ED NO:232, SEQ ED NO:248, SEQ ED NO:274, SEQ ED NO:285, SEQ ED NO:286, 
SEQ ED NO:297, SEQ ED NO:307, SEQ ED NO:308, SEQ ED NO:317, SEQ ED NO:318, 
SEQ ED NO:320, SEQ ID NO:323, SEQ ED NO:324, SEQ ED NO:329, SEQ ED NO:330, 
SEQ ED NO:340, SEQ ED NO:351, SEQ ED NO:365, SEQ ED NO:377, SEQ ED NO:384, 
SEQ ED NO:406, SEQ ID NO:408, SEQ ED NO:419, SEQ ED NO:421, SEQ ED NO:428, 

25 SEQ ED NO:437, SEQ ID NO:439, SEQ ED NO:445, SEQ ED NO:456, SEQ ED NO:462, 
SEQ ED NO:481, SEQ ED NO:484, SEQ ED NO:493, SEQ ED NO:496, SEQ ED NO:498, 
SEQ ED NO:519, SEQ ID NO:521, and SEQ ED NO:523. 

Lq another aspect, the present invention provides the nucleic acid sequences of 
SEQ ED NO: 165, SEQ ED NO:202, SEQ ED NO:210, SEQ ED NO:218, SEQ ED NO:227, 

30 SEQ ED NO:232, SEQ ED NO:248, SEQ ED NO:274, SEQ ED NO:285, SEQ ED NO:286, 
SEQ ED NO:297, SEQ ED NO:307, SEQ ED NO:308, SEQ ED NO:317, SEQ ED NO:318, 
SEQ ED NO:320, SEQ ED NO:323, SEQ ED NO:324, SEQ ED NO:329, and SEQ ED NO:330. 
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In anotner aspect, the present invention provides the amino acid sequences of 
SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID NO:319, SEQ ID NO:325, and 
SEQIDNO:331. 



5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a schematic diagram of an assay for modulators of 

angiogenesis. 

Figure 2 provides results of an experiment demonstrating the effect of a 
plakoglobin-GFP fusion protein expression on levels of the cell surface marker av/33. 
10 Plakoglobin is an exemplar angiogenesis regulatory protein. Human umbilical vein 

endothelial (HUVEC) cells were transfected with a vector expressing the plakoglobin-GFP 
fusion protein ((GGS) 2 = SEQ ID NO:525) or a control vector, expressing GFP only. Cells 
were incubated with APC-labeled antibodies directed against the cell surface marker av/33. 
The X-axis depicts cell number and the Y-axis depicts the amount of av/33 -APC antibody 
15 derived fluorescence. Cells transfected with the plakoglobin-GFP fusion protein construct 
exhibit lower av/33 expression levels than control cells. 

Figure 3 also provides results of an experiment demonstrating the effect of a 
plakoglobin-GFP fusion protein (i.e., GFP-AN-plakoglobin) expression on levels of the cell 
surface marker av/33. The figure also depicts the presence of armadillo repeats in the 
20 functional screen hit, GFP-AN-plakoglobin. 

Figure 4 demonstrates that GFP-AN-Plako expression downregulates 
vitronectin receptors (av/33 and av/35) but not the fibronectin receptor (a5/31). 

Figure 5 demonstrates that expression of GFP-AN-Plakoglobin downregulates 
endogenous catenin levels, as well as expression of endogenous plakoglobin and VE- 
25 cadherin. 

Figure 6 demonstrates that GFP-AN-Plakoglobin expression upregulates 
surface levels of the laminin receptor, o6/34. 

Figure 7 provides provides results of an experiment demonstrating the effect 
of a HoxB2-GFP fusion protein expression (i.e., AN-HoxB2) on levels of the cell surface 
30 marker av/33. Cells transfected with the AN-HoxB2-GFP fusion protein construct exhibit 

lower av/33 expression levels than control cells. The figure also provides the structure of the 
functional screen hit, demonstrating that the homeodomain is present in AN-HoxB2. 

Figure 8 demonstrates that AN-HoxB2 downregulates both av and /33 integrin 
subunit surface expression, as well as av/35 surface expression. 
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ire9 demonstrates that AN-HoxB2 Inhibits ProIIFei 



Figure^emonstrates that AN-HoxB2 Inhibits Proliferation in HUVECs, a 
primary cell line. 

Figure 10 demonstrates that full length HoxB2 phenocopies AN-HoxB2 in the 
downregulation of ov/33. 

5 Figure 1 1 provides provides results of an experiment demonstrating the effect 

of a SUSP-1 GFP fusion protein expression (i.e., GFP-SUSP-1) on levels of the cell surface 
marker ov£3. Cells transfected with the GFP-SUSP-1 fusion protein construct exhibit lower 
ov/33 expression levels than control cells. The figure also provides the structure of the 
functional screen hit, demonstrating that GFP-SUSP-1 is an N-terminal fusion in the sense 
10 orientation and that a cDNA insert is present before the wild-type start codon. 

Figure 12 demonstrates that the GFP-SUSP-1 screening hit does not affect 
proliferation when expressed in HUVEC (human umbilical vein endothelial cells), PASMC 
(pulmonary artery smooth muscle cells), and NHDF (normal human dermal fibroblasts). 

Figure 13 provides expression analysis of SUSP-1 mRNA in a variety of cell 

15 lines. 

Figure 14 demonstrates that SUSP-1 siRNAs inhibit SUSP-1 mRNA 

expression. 

Figure 15 provides a new sequence determination for a nucleic acid encoding 
an ABC transporter protein (SEQ ID NO:2). 
20 Figure 16 provides a new sequence determination for a nucleic acid encoding 

an HSPA5 protein (SEQ ID NO: 127). 

Figure 17 provides a new sequence determination for a nucleic acid encoding 
a protein disulfide isomerase (SEQ ID NO: 165). 

Figure 18 provides a new sequence determination for a nucleic acid encoding 
25 a novel protein from human chromosome 1 (SEQ ID NO:210). 

Figure 19 provides a new sequence determination for a nucleic acid encoding 
a novel protein from human chromosome 3 (SEQ ID NO:218). 

Figure 20 provides a new sequence determination for a nucleic acid encoding 
a novel protein from human chromosome 17 (SEQ ID NO:227). 
30 Figure 21 provides a new sequence determination for a nucleic acid encoding 

a novel protein from human chromosome 8 (SEQ ID NO:232). 

Figure 22 provides a new sequence determination for a nucleic acid encoding 
a novel protein from human chromosome 9 (SEQ ID NO:248). 
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Figur^^provides a new sequence determination for a nucleic acid encoding 
a 1/226 protein (SEQ ID NO:274). 

Figure 24 provides a new sequence determination for a nucleic acid encoding 
an FLJ10688 protein (SEQ ID NO:285). 
5 Figure 25 provides a new sequence determination for a nucleic acid encoding 

a KIAA1583 protein (SEQ ID NO:307). 

Figure 26 provides a new sequence determination for a nucleic acid encoding 
a KIAA1814 protein (SEQ ID NO:317). 

Figure 27 provides a new sequence determination for a nucleic acid encoding 
10 a novel protein from human chromosome 4 (SEQ ID NO:320). 

Figure 28 provides a new sequence determination for a nucleic acid encoding 
a peroxidasin/melanoma antigen related protein (SEQ ID NO:323). 

Figure 29 provides a new sequence determination for a nucleic acid encoding 
a WD40/SOCS box protein (SEQ ID NO:329). 
15 Figure 30 provides provides results of an experiment demonstrating the effect 

of a ASF ADS antisense screening hit (z.e., GFP-A5FADS or D5ADS) on levels of the cell 
surface marker av/33. Cells transfected with the GFP- ASF ADS antisense construct exhibit 
lower avj33 expression levels than control cells. The figure also provides the structure of the 
functional screen hit, demonstrating that GFP- A5FADS is in the antisense orientation and 
20 that a the anti sense portion of the A5FADS nucleic acid is about 300 base pairs long. 

Figure 31 demonstrates that the GFP-D5FADS antisense screening hit does 
not affect cell proliferation. 

Figure 32 demonstrates that A5FADS siRNAs inhibit A5FADS mRNA 

expression. 

25 Figure 33 demonstrates that A5FADS siRNAs inhibit haptotaxis. 

Figure 34 demonstrates that A5FADS siRNAs inhibit SDF-1 induced 
chemo taxis in Jurkat cells. 

Figure 35 demonstrates that A6FADS is a functional screening hit. The figure 
shows that A6FADS transcript is spliced to the KIAA1583 functional screen hit. 
30 Figure 36 demonstrates tha A6FADS mRNA levels are straongly reduced in 

cells that express the antisense KLAA1583 functional screen hit. 

Figure 37 that A6FADS siRNAs inhibit A6FADS mRNA expression and also 
reduce ovjS3 surface expression. 
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^fef AILED DESCRIPTION OF THE IN\^^I 



[ton 

Introduction 

Using a functional assay, angiogenesis regulatory proteins and nucleic acids 
have been identified and cloned from a green fluorescent protein (GFP) fusion library. 
5 Primary endothelial cells were transduced with retroviral cDNA/GFP fusion libraries. 
Transduced cells were selected and assayed for alterations consistent with inhibition of 
angiogenesis, e.g. downregulation of cell surface expression of av/33 or downregulation of 
haptotaxis. 

Using the screen described above, fifty-two proteins and/or DNA molecules 
10 were shown for the first time to regulate angiogenesis. Thirty-four of the proteins and/or 
DNA molecules are known, while fifteen of the proteins and/or DNA molecules are 
completely novel. However, this is the first demonstration of a function in regulation of 
angiogenesis for these molecules. 

The ABC transporter is involved in peroxisome biogenesis. Relevant 
1 5 sequence data for the protein, nucleic acids encoding the ABC transporter, and related 

sequences include the nucleic acid accession number NM_00003 3, SEQ ID NO:3; protein 
accession number NP J)00024, SEQ ID NO:4; a new sequence determination, SEQ ID NO:2; 
and the functional screen hit, SEQ ID NO: 1 . The protein has an ATP-binding domain. The 
gene encoding the ABC transporter is linked to a genetic disease, adrenoleukodystrophy 
20 (Mosser et a/., Nature 361 :682-3 (1993)). An antisense version of the ABC transporter gene, 
e.g., SEQ ID NO:l, was identified as a functional hit in the screen for modulators of 
angiogenesis. 

PCB1, a poly (rC) binding protein functions as a translational co-activator and 
also binds 3' UTRs of RNA, most likely through KH domains, thereby regulating RNA 

25 stability. Relevant sequence data for the protein, nucleic acids encoding PCB1, and related 
sequences include the nucleic acid accession number NM_006196, SEQ ID NO:32; protein 
accession number NPJ306187, SEQ ID NO:33; and the functional screen hit, SEQ ID NO:5- 
31. An antisense version of the gene encoding PCB1 was identified as a functional hit in the 
screen for modulators of angiogenesis. 

30 SLC1 A5, an amino acid transporter, has ten transmembrane spanning 

segments. Relevant sequence data for the protein, nucleic acids encoding SLC1A5, and 
related sequences include the nucleic acid accession number NM_005628, SEQ ID NO:43; 
protein accession number NP_005619, SEQ ED NO:44; and the functional screen hit, SEQ ID 
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s^^ version of the gene encoding SLC1A5 ^^i< 



NO:34-42. An antisense version of the gene encoding SLC1A5 wis identified as a functional 
hit in the screen for modulators of angiogenesis. 

Chromobox homolog 6 is involved in chromatin regulation. Relevant 
sequence data for the protein, nucleic acids encoding chromobox homolog 6, and related 
5 sequences include the nucleic acid accession number NM_014292, SEQ ID NO:57; protein 
accession number NPJ)5 5 107, SEQ ID NO:58; and the functional screen hit, SEQ ID 
NO:45-56. An antisense version of chromobox homolog 6 was identified as a functional hit 
in the screen for modulators of angiogenesis. 

Cytochrome C oxidase subunit 1 is encoded by a mitochondrial gene and 

10 transfers electrons from cytochrome C to 0 2 (Ingman et al., Nature 408:708-713 (2000); 
(Leninger, Principles of Biochemistry (1984); Stryer, Biochemistry (1995)) Relevant 
sequence data for the protein, nucleic acids encoding cytochrome C oxidase, and related 
sequences include the nucleic acid accession number NC_001807, SEQ ID NO:63; protein 
accession number NP_536845, SEQ ID NO:64; and the functional screen hit, SEQ ID 

15 NO:59-62. A sense version of the gene encoding cytochrome C oxidase, encoding a 21 
amino acid peptide, was identified as a functional hit in the screen for modulators of 
angiogenesis. 

A-5 fatty acid desaturase (FADSD5 or A5FADS) introduces double bonds into 
a fatty acyl chain. The protein is involved in arachadonic acid synthesis and has a fatty acid 

20 desaturase domain and a heme-binding domain. Relevant sequence data for the protein, 
nucleic acids encoding A-5 fatty acid desaturase, and related sequences include the nucleic 
acid accession number NM_013402, SEQ ID NO:68; protein accession number NPJ337534, 
SEQ ID NO:69; and the functional screen hit, SEQ ID NO:65-67. An antisense version of A- 
5 fatty acid desaturase gene was identified as a functional hit in the screen for modulators of 

25 angiogenesis. 

The dynactin p27 subunit protein was shown to be an angiogenesis regulatory 
protein. The protein has an RGD domain. Relevant sequence data for the protein, and the 
nucleic acid encoding dynactin p27 subunit protein, include the nucleic acid accession 
number NC_006571, SEQ ID NO:70; and protein accession number NP_006562.1, SEQ ID 
30 NO:71. An antisense version of the gene encoding dynactin p27 subunit protein was 
identified as a functional hit in the screen for modulators of angiogenesis. 

Elongation factor 1 alpha (EFla) was shown to be involved in regulation of 
angiogenesis. Relevant sequence data for the protein, nucleic acids encoding EFla, and 
related sequences include the nucleic acid accession number NC_001402, SEQ ID NO:76; 

10 
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it^Televant sequence data for the protein, nuciSic ; 



functional screen hit^elevant sequence data for the protein, nucilic acids encoding HSPA5, 
and related sequences include the nucleic acid accession number NM_005347, SEQ ID 
NO: 128; protein accession number NP_005338, SEQ ID NO: 129; a new sequence 
determination, SEQ ID NO: 127; and the functional screen hit, SEQ ID NO: 122- 126. 

5 Interferon gamma receptor 1 (IFN7RI) is a cytokine receptor that induces 

antiangiogenic genes. Relevant sequence data for the protein, nucleic acids encoding 
IFN7RI, and related sequences include the nucleic acid accession number NM_000416, SEQ 
ID NO:139; protein accession number NP_000407, SEQ ID NO:140; and the functional 
screen hit, SEQ ID NO: 130- 13 8. An antisense version of the IFN^l gene was identified as 

10 a functional hit in the screen for modulators of angiogenesis. 

Importin a4 (karyopherin cB) is a subunit of the nuclear localization signal 
receptor. Relevant sequence data for the protein, nucleic acids encoding importin a4, and 
related sequences include the nucleic acid accession number NM_002267, SEQ ID NO: 153; 
protein accession number NP_002258.1, SEQ ED NO: 154; and the functional screen hit, SEQ 

15 ID NO:141-152. An antisense version of the importin o4 gene was identified as a functional 
hit in the screen for modulators of angiogenesis. 

The lysosomal pepstatin-insensitive protease (CLN2) is encoded by a gene 
associated with juvenile neuronal ceroid lipofuscinosis disease. Relevant sequence data for 
the protein, nucleic acids encoding CLN2, and related sequences include the nucleic acid 

20 accession number NM_000391, SEQ ID NO:163; protein accession number NP_000382, 
SEQ ID NO:164; and the functional screen hit, SEQ ID NO:155-162. An antisense version 
of the CLN2 gene was identified as a functional hit in the screen for modulators of 
angiogenesis. 

A novel protein disulfide isomerase was shown to have a role in angiogenesis. 

25 Relevant sequence data for the nucleic acid encoding the novel protein disulfide isomerase, 
includes SEQ ID NO: 165. An antisense version of gene encoding the novel protein disulfide 
isomerase was identified as a functional hit in the screen for modulators of angiogenesis. The 
antisense hit encodes a 45 or 32 amino acid peptide that may also be useful as a therapeutic or 
to modulate angiogenesis. 

30 Microtubule-associated protein 4 (MAP4) binds tubulin and may link 

microtubules to other elements of the cytoskeleton. See e.g., Chapin & Bulinski, J. Cell. Sci 
98:27-36 (1991); and Chapin et al., Biochemistry 34:2289-2301 (1995). Four isoforms of 
MAP4 are known. Relevant sequence data for the proteins, nucleic acids encoding MAP4 
isoforms, and related sequences include the nucleic acid accession numbers NM_002375, 
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Plasmlfogen activator inhibitor 1 (SERPINE1 orWa-1) was identified as an 
angiogenesis regulatory protein. PAI-1 is a protease inhibitor. Relevant sequence data for 
the protein, nucleic acids encoding PAI-1, and related sequences include the nucleic acid 
accession number NM_000602, SEQ ID NO:419; protein accession number NP_000593, 
5 SEQ ED NO:420; and the functional screen hit, SEQ ID NO:410-418. The functional screen 
hit is in the antisense orientation. 

Proteosomal subunit Y, an INF? regulated proteosomal subunit, was identified 
as an angiogenesis regulatory protein. Relevant sequence data for proteosomal subunit Y 
includes the functional screen hit, SEQ ID NO:421. The functional screen hit is in the 
10 antisense orientation. 

Rap2B was identified as an angiogenesis regulatory protein. Rap2B is a 
member of the RAS family of GTPases. Relevant sequence data for the protein, nucleic acids 
encoding Rap2B, and related sequences include the nucleic acid accession number 
XM_003032, SEQ ID NO:428; protein accession number XP_003032, SEQ ID NO:429; and 
15 the functional screen hit, SEQ ID NO:422-427. The functional screen hit is in the antisense 
orientation. 

Semaphorin 3F was identified as an angiogenesis regulatory protein. 
Semaphorin 3F is a ligand for NP2. Relevant sequence data for the protein, nucleic acids 
encoding semaphorin 3F, and related sequences include the nucleic acid accession number 
20 U38276, SEQ ID NO:437; protein accession number AAB 18276, SEQ ID NO:438; and the 
functional screen hit, SEQ ID NO:430-436. The functional screen hit is in the antisense 
orientation. 

SPARC was identified as an angiogenesis regulatory protein. Relevant 
sequence data for the protein, nucleic acids encoding SPARC, and related sequences include 
25 the nucleic acid accession number NM_003 118, SEQ ID NO:439; and protein accession 
number NP_003 109, SEQ ID NO:440. 

ssDNA binding protein-3 was identified as an angiogenesis regulatory protein. 
Relevant sequence data for the protein, nucleic acids encoding ssDNA binding protein- 1, and 
related sequences include the nucleic acid accession number NM_0 18070, SEQ ID NO:445; 
30 protein accession number NP_060540, SEQ ID NO:446; and the functional screen hitj SEQ 
ID NO:44 1-444. The functional screen hit is in the sense orientation. 

Sumo protease (SUSP-1) was identified as an angiogenesis regulatory protein. 
Relevant sequence data for the protein, nucleic acids encoding SUSP-1, and related 
sequences include the nucleic acid accession number NM_0 15571, SEQ ID NO: 45 6; protein 
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The terms "angiogenesis regulatory protein or nucleic acid" or a nucleic acid 
encoding "angiogenesis regulatory protein" refer to nucleic acid and polypeptide 
polymorphic variants, alleles, mutants, and interspecies homologs that: (1) have an amino 
acid sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
5 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino acid sequence identity, preferably over a region of over a region of at least about 25, 
50, 100, 200, 500, 1000, or more amino acids, to a polypeptide encoded by a nucleic acid of 
SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ ID NO:63, SEQ ID 
NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, SEQ ID NO:89, 

10 SEQ ID NO:120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, 
SEQ ID NO:165, SEQ ID NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID NO:175, 
SEQ ID NO:183, SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, 
SEQ ID NO:232, SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, 
SEQ ID NO:297, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, 

15 SEQ ID NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, 
SEQ ID NO:340, SEQ ID NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, 
SEQ ID NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, 
SEQ ID NO:437, SEQ ID NO:439, SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, 
SEQ ID NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, and 

20 SEQ ID NO:5 19, or an amino acid sequence of SEQ ID NO:4, SEQ ID NO:33, SEQ ID 

NO:44, SEQ ID NO:58, SEQ ID NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, 
SEQ ID NO:82, SEQ ID NO:87, SEQ ID NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID 
NO:140, SEQ ID NO:154, SEQ ID NO:164, SEQ ID NO:170, SEQ ID NO:172, SEQ ID 
NO:174, SEQ ID NO:176, SEQ ID NO:184, SEQ ID NO:203, SEQ ID NO:287, SEQ ID 

25 NO:298, SEQ ID NO:309, SEQ ID NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID 
NO:341, SEQ ID NO:352, SEQ ID NO:366, SEQ ID NO:378, SEQ ID NO:385, SEQ ID 
NO:407, SEQ ID NO:409, SEQ ID NO:420, SEQ ID NO:429, SEQ ID NO:438, SEQ ID 
NO:440, SEQ ID NO:446, SEQ ID NO:457, SEQ ID NO:463, SEQ ID NO:482, SEQ ID 
NO:485, SEQ ID NO:494, SEQ ID NO:497, SEQ ID NO:499, SEQ ID NO:520, SEQ ID 

30 NO:522, and SEQ ID NO:524or other sequences Usted herein; (2) specifically bind to 

antibodies, e.g., polyclonal antibodies, raised against an immunogen comprising an amino 
acid sequence of SEQ ID NO:4, SEQ ID NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ ID 
NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ID NO:82, SEQ ID NO:87, 
SEQ ID NO:90, SEQ ID NO:121, SEQ ID NO:129, SEQ ID NO:140, SEQ ID NO:154, SEQ 

24 



: 1 70, SEQ ID NO: 1 72, SEQ ID NO: 1 74^E( 



ID NO: 164, SEQ ID NO: 170, SEQ ID NO: 172, SEQ ID NO:174TTeQ ED NO: 176, SEQ ID 
NO: 184, SEQ ID NO:203, SEQ ID NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID 
NO:319, SEQ ID NO:325, SEQ ID NO:331, SEQ ID NO:341, SEQ ED NO:352, SEQ ED 
NO:366, SEQ ED NO:378, SEQ ED NO:385, SEQ ID NO:407, SEQ ED NO:409, SEQ ID 
5 NO:420, SEQ ID NO:429, SEQ ED NO:438, SEQ ID NO.440, SEQ ID NO:446, SEQ ID 
NO:457, SEQ ID NO:463, SEQ ED NO:482, SEQ ED NO:485, SEQ ID NO:494, SEQ ID 
NO:497, SEQ ID NO:499, SEQ ED NO:520, SEQ ID NO:522, and SEQ ID NO:524, 
immunogenic fragments thereof, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid encoding 

10 SEQ ID NO:4, SEQ ED NO:33, SEQ ID NO:44, SEQ ID NO:58, SEQ ID NO:64, SEQ ID 
NO:69, SEQ ID NO:71, SEQ ID NO:77, SEQ ED NO:82, SEQ ED NO:87, SEQ ID NO:90, 
SEQ ED NO:121, SEQ ED NO:129, SEQ ED NO:140, SEQ ID NO:154, SEQ ED NO:164, 
SEQ ED NO: 170, SEQ ED NO: 172, SEQ ED NO: 174, SEQ ED NO: 176, SEQ ID NO: 184, 
SEQ ED NO:203, SEQ ED NO:287, SEQ ED NO:298, SEQ ED NO:309, SEQ ED NO:319, 

15 SEQ ID NO:325, SEQ ED NO:331, SEQ ED NO:341, SEQ ID NO:352, SEQ ID NO:366, 
SEQ ID NO:378, SEQ ED NO:385, SEQ ED NO:407, SEQ ED NO:409, SEQ ED NO:420, 
SEQ ED NO:429, SEQ ED NO:438, SEQ ED NO:440, SEQ ED NO:446, SEQ ED NO:457, 
SEQ ED NO:463, SEQ ED NO:482, SEQ ED NO:485, SEQ ED NO:494, SEQ ID NO:497, 
SEQ ED NO:499, SEQ ED NO:520, SEQ ED NO:522, and SEQ ED NO:524, e.g., a nucleic 

20 acid sequence of SEQ ED NO:3, SEQ ED NO:32, SEQ ED NO:43, SEQ ED NO:57, SEQ ED 
NO:63, SEQ ED NO:68, SEQ ED NO:70, SEQ ED NO:76, SEQ ED NO:81, SEQ ED NO:86, 
SEQ ED NO:89, SEQ ED NO:120, SEQ ID NO:128, SEQ ID NO:139, SEQ ED NO:153, SEQ 
ED NO: 163, SEQ ED NO: 165, SEQ ED NO: 169, SEQ ED NO: 171, SEQ ED NO: 173, SEQ ED 
NO:175, SEQ ID NO:183, SEQ ID NO:202, SEQ ID NO:210, SEQ ED NO:218, SEQ ID 

25 NO:227, SEQ ED NO:232, SEQ ED NO:248, SEQ ED NO:274, SEQ ED NO:285, SEQ ED 
NO:286, SEQ ED NO:297, SEQ ED NO:307, SEQ ID NO:308, SEQ ED NO:317, SEQ ED 
NO:318, SEQ ED NO:320, SEQ ED NO:323, SEQ ED NO:324, SEQ ED NO:329, SEQ ED 
NO:330, SEQ ED NO:340, SEQ ED NO:351, SEQ ED NO:365, SEQ ID NO:377, SEQ ID 
NO:384, SEQ ID NO:406, SEQ ID NO:408, SEQ ED NO:419, SEQ ED NO:421, SEQ ID 

30 NO:428, SEQ ED NO:437, SEQ ED NO:439, SEQ ID NO:445, SEQ ED NO:456, SEQ ED 
NO:462, SEQ ED NO:481, SEQ ID NO:484, SEQ ED NO:493, SEQ ED NO:496, SEQ ED 
NO:498, SEQ ED NO:519, SEQ ED NO:521, and SEQ ED NO:523; or its complement, and 
conservatively modified variants thereof; (4) have a nucleic acid sequence that has greater 
than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or higher nucleotide 
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„ n _ m ably over a region of at least about 25, 5(TlO0, 200, 500, 1000, or 

more nucleotides, to SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID NO:57, SEQ 
ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID NO:81, SEQ ID NO:86, 
SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ 
ID NO:163, SEQ ID NO:165, SEQ ID NO:169, SEQ ID NO:171, SEQ ID NO:173, SEQ ID 
NO:175, SEQ ID NO:183, SEQ ID NO.202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID 
NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID NO:274, SEQ ED NO:285, SEQ ID 
NO:286, SEQ ED NO:297, SEQ ED NO:307, SEQ ED NO:308, SEQ ED NO:317, SEQ ED 
NO:318, SEQ ED NO:320, SEQ ED NO:323, SEQ ED NO:324, SEQ ED NO:329, SEQ ED 
NO:330, SEQ ED NO.340, SEQ ED NO:351, SEQ ED NO:365, SEQ ED NO:377, SEQ ED 
NO:384, SEQ ED NO:406, SEQ ED NO:408, SEQ ED NO:419, SEQ ED NO:421, SEQ ED 
NO:428, SEQ ED NO:437, SEQ ED NO:439, SEQ ID NO:445, SEQ ED NO:456, SEQ ED 
NO:462, SEQ ED NO:481, SEQ ED NO:484, SEQ ID NO:493, SEQ ED NO:496, SEQ ED 
NO:498, SEQ ED NO:519, SEQ ED NO:521, and SEQ ED NO:523, or the complement of any 
of those sequences. A polynucleotide or polypeptide sequence is typically from a mammal 
including, but not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal. The nucleic acids and proteins of the invention include both 
naturally occurring or recombinant molecules. When available, accession numbers for the 
human angiogenesis regulatory proteins and genes are provided. 

The phrase "functional effects" in the context of assays for testing compounds 
that modulate activity of a angiogenesis regulatory proteins includes the determination of a 
parameter that is indirectly or directly under the influence of a angiogenesis regulatory 
protein polypeptide, e.g., a chemical or phenotypic effect such as loss-of angiogenesis 
phenotype represented by a change in expression of a cell surface marker, such as av#3 
integrin, or changes in cellular proliferation, especially endothelial cell proliferation; or 
enzymatic activity, or, e.g., a physical effect such as ligand binding or inhibition of ligand 
binding. A functional effect therefore includes ligand binding activity, the ability of cells to 
proliferate, expression in cells undergoing angiogenesis, and other characteristics of 
angiogenic cells. "Functional effects" include in vitro, in vivo, and ex vivo activities. 
Angiogenesis assays are described, e.g., in Angiogenesis Protocols (Murray, ed., 2001). 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 
angiogenesis regulatory proteins, e.g., measuring physical and chemical or phenotypic 
effects. Such functional effects can be measured by any means known to those skilled in the 
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ly, the selectin family, and the like; see, e!g., . 



family, the integrin family, the selectin family, and the like; see, elg., Pigott & Power, The 
Adhesion Molecule Facts Book I (1993). Similarly, toxins and venoms, viral epitopes, 
hormones (e.g., opiates, steroids, etc.), intracellular receptors (e.g. which mediate the effects 
of various small ligands, including steroids, thyroid hormone, retinoids and vitamin D; 
5 peptides), drugs, lectins, sugars, nucleic acids (both linear and cyclic polymer 

configurations), oligosaccharides, proteins, phospholipids and antibodies can all interact with 
various cell receptors. 

Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, 
polyureas, polyamides, polyethyleneimines, polyarylene sulfides, polysiloxanes, polyimides, 

10 and polyacetates can also form an appropriate tag or tag binder. Many other tag/tag binder 
pairs are also useful in assay systems described herein, as would be apparent to one of skill 
upon review of this disclosure. 

Common linkers such as peptides, polyethers, and the like can also serve as 
tags, and include polypeptide sequences, such as poly Gly sequences of between about 5 and 

15 200 amino acids (SEQ ID NO:526). Such flexible linkers are known to persons of skill in the 
art. For example, poly(ethylene glycol) linkers are available from Shearwater Polymers, Inc. 
Huntsville, Alabama. These linkers optionally have amide linkages, sulfhydryl linkages, or 
heterofunctional linkages. 

Tag binders are fixed to solid substrates using any of a variety of methods 

20 currently available. Solid substrates are commonly derivatized or functionalized by exposing 
all or a portion of the substrate to a chemical reagent which fixes a chemical group to the 
surface which is reactive with a portion of the tag binder. For example, groups which are 
suitable for attachment to a longer chain portion would include amines, hydroxyl, thiol, and 
carboxyl groups. Aminoalkylsilanes and hydroxyalkylsilanes can be used to functionalize a 

25 variety of surfaces, such as glass surfaces. The construction of such solid phase biopolymer 
arrays is well described in the literature. See, e.g., Merrifield, J. Am. Chem. Soc. 85:2149- 
2154 (1963) (describing solid phase synthesis of, e.g., peptides); Geysen et al, J. Immun. 
Meth. 102:259-274 (1987) (describing synthesis of solid phase components on pins); Frank & 
Doring, Tetrahedron 44:60316040 (1988) (describing synthesis of various peptide sequences 

30 on cellulose disks); Fodor et al., Science, 25 1 :767-777 (1991); Sheldon et al, Clinical 

Chemistry 39(4):718-719 (1993); and Kozal et al, Nature Medicine 2(7):753759 (1996) (all 
describing arrays of biopolymers fixed to solid substrates). Non-chemical approaches for 
fixing tag binders to substrates include other common methods, such as heat, cross-linking by 
UV radiation, and the like. 
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EKGLRERAYGSSGEL ITSLP I S I PLSTVQPNKLPVS I PLAS WLPSRAERARSTPS PVLQPRDPSSTLEK 
QIGANAHGAGSRSLALAPAGFSYAGSVAISGALAGSPASLTPGAEPATLDESSSSGSLFATVGSRSSTPQ 
HPLLLAQPRNSLPASPAHQLSSSPRLGGAAQGPLPEASKGDLPSDSGFSDPESEAKRRIVFTITTGAGSA 
KQSPSSKHSPLTASARGDCVPSHGQDSRRRGRRKRASAGTPSLSAGVSPKRJtAIiPSVAGIiFTQPSGSPIi^ 
LNSMVSNINQPLEITAISSPETSLKSSPVPYQDHDQPPVLKKERPLSQTNGAHYSPLTSDEEPGSEDEPS 
SARIERKIATISLESKSPPKTLENGGGLAGRKPAPAGEPVNSSKWKSTFSPISDIGLAKSADSPLQASSA 
LSQNSLFTFRPALEEPSADAKLAAHPRKGFPGSLSGADGLSPGTNPANGCTFGGGLAADLSLHSFSDGAS 
LPHKGPEAAGLSSPLSFPSQRGKEGSDANPFLSKRQLDGLAGLKGEGSRGKEAGEGGLPLCGPTDKTPLL 
SGKAAKARDREVDLKNGHNLFISAAAVPPGSLLSGPGLAPAASSAGGAASSAQTHRSFLGPFPPGPQFAL 
GPMSLQANLGSVAGSSVLQSLFSSVPAAAGLVHVSSAATRLTNSHAMGSFSGVAGGTVGGVFNHAVPSAS 
AHPFGARVGRGAACGSATLGPSPLQAAASASASSFQAPASVETRPPPPPPPPPPPLPPPAHLGRSPAGPP 
VLHAPPPPNAALPPPPTLLASNPEPALLQSLASLPPNQAFLPPTSAASLPPANASLSIKIiTSLPHKGARP 
SFTVHHQPLPRLALAQAAPGIPQASATGPSAVWVSLGMPPPYAAHLSGVKPR 

SEQ ID NO:320 

Novel (maps to chromosome 4) 
Novel (Chromosome 4) > 

GGAATAAAAGAGTGGAAATGGGGATTTCCAGGTGCTCCCCTGGTTCATCTAGGCACCAGAGAGCTGCACTAGCAG 
GTCTATCATGAATCTCCTTGGAATGCTCATTTTTAGTCCTACTTGATGTGTCTGTTTCTGGAAATGCAGTATTTT 
TAATGTATCTCAACAAAATATTTTATGATTAGTAAGCTTATTCTTATATAAAGGACAATTTTTTTCCTTTTTCAC 
AGGTTCTAATAATTTTTTATTTAATAATTAGATCTATTAGATTTTATTCATAACTGTGGTAGTTGAAGTACCTTC 
TAAGCTGAGTTCAGATTTGAGAATAAACCTTGGGGTATCATTACAGAAAATTTTGTCTCAATCTGCTTTGTATTT 
GAAAGATATGAGATTCTTGAATTATATATCTTACAGACTAGTCCCCAAAAGAATACGTGTTTCCTTACCTTTAAT 
TTCTCATGGTAGTTAGTCTGTGAAT 

SEQIDNO:321 

Novel peroxidasin-like/ melanoma antigen 
>GL2-86-4M13F Direction: sense 

CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGC 
CACCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCG 
CGCCAACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCT 
GCTCAAGCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCA 
GCACATCACCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGG 
CTACGACCCCGGCATCAATGCTGGCATCTTCAACGCCTTCGCCACCGCGGCCTTCAGGTTTGGCCACACGCTTGT 
CAACCCACTGCTTTACCGGCTGGACGAGAACTTCCAGCCCATTGCACAAGATCACCTCCCCCTTCACAAAGCTTT 
CTTCTCTCCCTTCCGGATTG 

SEQ ID NO:322 

>GL2_93_2B0 8_G3F1 Direction: sense 

CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGC 
CACCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCG 
CGCCAACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCT 
GCTCAAGCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCA 
GCACATCACCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGG 
CTACGACCCCGGCATCAATGCTGGCATCTTCAACGCCTTCGCCACCGC 

SEQ ID NO:323 

Novel peroxidasin-like/ melanoma antigen> 

CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGC 
CACCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCG 
CGCCAACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCT 
GCTCAAGCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCA 
GCACATCACCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGG 
CTACGACCCCGGCATCAATGCTGGCATCTTCAACGCCTTCGCCACCGCGGCCTTCAGGTTTGGCCACACGCTTGT 
CAACCCACTGCTTTACCGGCTGGACGAGAACTTCCAGCCCATTGCACAAGATCACCTCCCCCTTCACAAAGCTTT 
CTTCTCTCCCTTCCGGATTGTGAATGAGGGCGGCATCGATCCGCTTCTCAGGGGGCTGTTCGGGGTGGCGGGGAA 
AATGCGTGTGCCCTCGCAGCTGCTGAACACGGAGCTCACGGAGCGGCTGTTCTCCATGGCACACACGGTGGCTCT 
GGACCTGGCGGCCATCAACATCCAGCGGGGCCGGGACCACGGGATCCCACCCTACCACGACTACAGGGTCTACTG 
CAATCTATCGGCGGCACACACGTTCGAGGACCTGAAAAATGAGATTAAAAACCCTGAGATCCG 
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WHAT IS CLAIME 

1 1 . A method for identifying a compound that modulates angiogenesis, 

2 the method comprising the steps of: 

3 (i) contacting the compound with a nucleic acid, or a polypeptide or a 

4 fragment thereof encoded by a nucleic acid, wherein the nucleic acid hybridizes under 

5 stringent conditions to a second nucleic acid comprising a nucleotide sequence selected 

6 from the group consisting of SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID 

7 NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID 

8 NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID 

9 NO: 139, SEQ ID NO: 153, SEQ ID NO: 163, SEQ ID NO: 165, SEQ ID NO: 169, SEQ ID 

10 NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID NO:202, SEQ ID 

1 1 NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID 

12 NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID 

13 NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID 

14 NO:324, SEQ ID NO:329, SEQ ED NO:330, SEQ ID NO:340, SEQ ID NO:351, SEQ ID 

15 NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID NO:408, SEQ ID 

16 NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID NO:439, SEQ ID 

1 7 NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:48 1 , SEQ ID NO:484, SEQ ID 

18 NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:519, SEQ ID NO:521, and SEQ 

19 IDNO:523;and 

20 (ii) determining the functional effect of the compound upon the nucleic 

2 1 acid or polypeptide. 

1 2. The method of claim 1, wherein the functional effect is determined 

2 in vitro. 

1 3. The method of claim 2, wherein the functional effect is a physical 

2 effect. 

1 4. The method of claim 2, wherein the functional effect is determined 

2 by measuring ligand or substrate binding to the polypeptide. 

1 5. The method of claim 2, wherein the functional effect is a chemical 

2 effect. 
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1 ID NO:76, SEQ ID N0:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO: 120, SEQ ID 

2 NO:128, SEQ ID NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID 

3 NO:169, SEQ ID N0:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID 

4 NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID 

5 NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID 

6 NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID 

7 NO:323, SEQ ID NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID 

8 NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID 

9 NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID NO:437, SEQ ID 

10 NO:439, SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID 

1 1 NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:5 19, SEQ ID 

12 NO:521,andSEQIDNO:523. 

1 19. The method of claim 1, wherein the polypeptide comprises a 

2 sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO:33, SEQ ID 

3 NO:44, SEQ ID NO:58, SEQ ID NO:64, SEQ ID NO:69, SEQ ID NO:71, SEQ ID 

4 NO:77, SEQ ID NO:82, SEQ ID NO:87, SEQ ID NO:90, SEQ ID NO:121, SEQ ID 

5 NO:129, SEQ ID NO:140, SEQ ID NO:154, SEQ ID NO:164, SEQ ID NO:170, SEQ ID 

6 NO:172, SEQ ID NO:174, SEQ ID NO:176, SEQ ID NO:184, SEQ ID NO:203, SEQ ID 

7 NO:287, SEQ ID NO:298, SEQ ID NO:309, SEQ ID NO:319, SEQ ID NO:325, SEQ ID 

8 NO:331, SEQ ID NO:341, SEQ ID NO:352, SEQ ID NO:366, SEQ ID NO:378, SEQ ID 

9 NO:385, SEQ ID NO:407, SEQ ID NO:409, SEQ ID NO:420, SEQ ID NO:429, SEQ ID 

10 NO:438, SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:457, SEQ ID NO:463, SEQ ID 

1 1 NO:482, SEQ ID NO:485, SEQ ID NO:494, SEQ ID NO:497, SEQ ID NO:499, SEQ ID 

12 NO:520, SEQ ID NO:522, and SEQ ID NO:524. 



1 20. The method of claim 1, wherein the compound is an antibody. 

1 21. The method of claim 1 , wherein the compound is an antisense 

2 molecule. 

1 22. The method of claim 1, wherein the compound is a small organic 

2 molecule. 

1 23. The method of claim 1, wherein the compound is a peptide. 
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1 24. A method for identifying a compound that modulates angiogenesis, 

2 the method comprising the steps of: 

3 (i) contacting the compound with a nucleic acid, or a polypeptide or a 

4 fragment thereof encoded by a nucleic acid, wherein the nucleic acid hybridizes under 

5 stringent conditions to a second nucleic acid comprising a nucleotide sequence selected 

6 from the group consisting of SEQ ID NO:3, SEQ ID NO:32, SEQ ID NO:43, SEQ ID 

7 NO:57, SEQ ID NO:63, SEQ ID NO:68, SEQ ID NO:70, SEQ ID NO:76, SEQ ID 

8 NO:81, SEQ ID NO:86, SEQ ID NO:89, SEQ ID NO:120, SEQ ID NO:128, SEQ ID 

9 NO:139, SEQ ID NO:153, SEQ ID NO:163, SEQ ID NO:165, SEQ ID NO:169, SEQ ID 

10 NO:171, SEQ ID NO:173, SEQ ID NO:175, SEQ ID NO:183, SEQ ID NO:202, SEQ ID 

1 1 NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID 

12 NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID 

13 NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID 

14 NO:324, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:340, SEQ ID NO:351, SEQ ID 

15 NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID NO:406, SEQ ID NO:408, SEQ ID 

16 NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ED NO:437, SEQ ID NO:439, SEQ ID 

17 NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID NO:481, SEQ ID NO:484, SEQ ED 

18 NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID NO:519, SEQ ID NO:521, and SEQ 

19 EDNO:523; 

20 (ii) determining the functional effect of the compound upon the nucleic 

2 1 acid or polypeptide; and 

22 (iii) expressing the nucleic acid or polypeptide in a cell, contacting the 

23 nucleic acid or polypeptide with the compound, and determining the phenotypic or 

24 chemical effect upon the cell. 

1 25. A method of modulating angiogenesis in a subject, the method 

2 comprising the step of administering to the subject a therapeutically effective amount of a 

3 compound identified using the method of claim 1. 

1 26. The method of claim 25, wherein the subject is a human. 

1 27. The method of claim 25, wherein the compound is an antibody. 

1 28. The method of claim 25, wherein the compound is an antisense 

2 molecule. 
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12 NO:320, SEQ HmO:323, SEQ ID NO:324, SEQ ID NO:329^EQ ID NO:330, SEQ ID 

13 NO:340, SEQ ID NO:351, SEQ ID NO:365, SEQ ID NO:377, SEQ ID NO:384, SEQ ID 

14 NO:406, SEQ ID NO:408, SEQ ID NO:419, SEQ ID NO:421, SEQ ID NO:428, SEQ ID 

15 NO:437, SEQ ID NO:439, SEQ ID NO:445, SEQ ID NO:456, SEQ ID NO:462, SEQ ID 

16 NO:481, SEQ ID NO:484, SEQ ID NO:493, SEQ ID NO:496, SEQ ID NO:498, SEQ ID 

17 NO:519, SEQ ID NO:521, and SEQ ID NO:523. 

1 35. An isolated nucleic acid, wherein the nucleic acid acid hybridizes 

2 under stringent conditions to a polynucleotide comprising a nucleotide sequence selected 

3 from the group consisting of SEQ ID NO: 1 65, SEQ ID NO:202, SEQ ID NO:2 1 0, SEQ 

4 ID NO:21 8, SEQ ID NO:227, SEQ ID NO:232, SEQ ID NO:248, SEQ ID NO:274, SEQ 

5 ID NO:285, SEQ ID NO:286, SEQ ID NO:297, SEQ ID NO:307, SEQ ID NO:308, SEQ 

6 ID NO:317, SEQ ID NO:318, SEQ ID NO:320, SEQ ID NO:323, SEQ ID NO:324, SEQ 

7 ID:329, and SEQ ID:330. 

1 36. The isolated nucleic acid of claim 35, wherein the nucleic acid 

2 comprises a nucleotide sequence selected from the group consisting of SEQ ID NO: 165, 

3 SEQ ID NO:202, SEQ ID NO:210, SEQ ID NO:218, SEQ ID NO:227, SEQ ID NO:232, 

4 SEQ ID NO:248, SEQ ID NO:274, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:297, 

5 SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:320, 

6 SEQ ID NO:323, SEQ ID NO:324, SEQ ID:329, and SEQ ID:330. 

1 37. An isolated polypeptide comprising an amino acid sequence 

2 selected from the group consisting of SEQ ID NO:287, SEQ ID NO:298, SEQ ID 

3 NO:309, SEQ ID NO:3 1 9, SEQ ID NO:325, and SEQ ID NO:33 1 . 
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INFORMAL SEQUENCE LISTING 

SEQBDNO:l 

5 ABC Transporter 

GL2_9 6_2 CO 1_G3 F 1 

ATGTAGAACATGCGCTGGGGTGGGGGCTTGTAGAGCACACCACCGTACGTGGGCCAGAGCCCACCCAGGATCCGG 
AATAGGGAGCTCTTGCCGCAGCCATTGGGGCCTGTGATGAGCAGATGCATGCCTATCCTCCACCTGATGTTGAGG 
CTGGGGCACCACAACTCTCCTGANGGCGTGAACGAATCGGGAATGTTTCCATCCCGTGCGAAGGAAGTCTNNGTG 

10 TCGACGCGCTCTCGNCTCATCACAACACTCCAACAAACACGTGGGCTCGGAATCTTCAAGGGGGGGGCCATCCAG 
AAAACAGCGAACAAACCACTGAACCTGGGGCGTAGATGAGGCTCNCACCCAAGATAAAACACCCCGGGGCGCCTT 
GGGAAGACCCGGGTCTCCCGTCCCTTAANACGTTCTCCCCATGTGGCGGGGCCCACTCCCTTCTTGGTNAC^ACN 
GGCTTGGCACCAAGACCAGGCCGCTCGCAACACCGCAGTCCGCGTTTTTCTCAGACAGCAAAGTTTANCNCCCCT 
TTGTAGGAAGAAAACCCAAAGAGTTTCCGATTCCACAGGAGTTTGGGCCGCAATGCAACAGNCCCAGGAGCGGGG 

15 NNGCCGCACTTCGAGTCTAGCGGATTTATATATAGACACGCCGCAACTAGNGAGCGTCCCCTCGAAAAGAGCCCN 

AGNTTACACNCCCGGGGGGTATCCGGTCAAAAGCAGCCCNTTACCCTCCCCTTNTTTTGGTGGTTAATACCGGCG 
GAAAAAAC 

SEQ ID NO:2 
20 ABC transporter> 

GGACGCGCCTGGTGCCCCGGGGAGGGGCGCCACCGGGGGAGGAGGAGGAG 

GAGAAGGTGGAGAGGAAGAGACGCCCCCTCTGCCCGAGACCTCTCAAGGC 

CCTGACCTCAGGGGCCAGGGCACTGACAGGACAGGAGAGCCAAGTTCCTC 

CACTTGGGCTGCCCGAAGAGGCCGCGACCCTGGAGGGCCCTGAGCCCACC 
25 GCACCAGGGGCCCCAGCACCACCCCGGGGGCCTAAAGCGACAGTCTCAGG 

GGCCATCGCAAGGTTTCCAGTTGCCTAGACAACAGGCCCAGGGTCAGAGC 

AACAATCCTTCCAGCCACCTGCCTCAACTGCTGCCCCAGGCACCAGCCCC 

AGTCCCTACGCGGCAGCCAGCCCAGGTGACATGCCGGTGCTCTCCAGGCC 

CCGGCCCTGGCGGGGGAACACGCTGAAGCGCACGGCCGTGCTCCTGGCCC 
3 0 TCGCGGCCTATGGAGCCCACAAAGTCTACCCCTTGGTGCGCCAGTGCCTG 

GCCCCGGCCAGGGGTCTTCAGGCGCCCGCCGGGGAGCCCACGCAGGAGGC 

CTCCGGGGTCGCGGCGGCCAAAGCTGGCATGAACCGGGTATTCCTGCAGC 

GGCTCCTGTGGCTCCTGCGGCTGCTGTTCCCCCGGGTCCTGTGCCGGGAG 

ACGGGGCTGCTGGCCCTGCACTCGGCCGCCTTGGTGAGCCGCACCTTCCT 
35 GTCGGTGTATGTGGCCCGCCTGGACGGAAGGCTGGCCCGCTGCATCGTCC 

GCAAGGACCCGCGGGCTTTTGGCTGGCAGCTGCTGCAGTGGCTCCTCATC 

GCCCTCCCTGCTACCTTCGTCAACAGTGCCATCCGTTACCTGGAGGGCCA 

ACTGGCCCTGTCGTTCCGCAGCCGTCTGGTGGCCCACGCCTACCGCCTCT 

ACTTCTCCCAGCAGACCTACTACCGGGTCAGCAACATGGACGGGCGGCTT 

40 

CGCAACCCTGACCAGTCTCTGACGGAGGACGTGGTGGCCTTTGCGGCCTC 

TGTGGCCCACCTCTACTCCAACCTGACCAAGCCACTCCTGGACGTGGCTG 

TGACTTCCTACACCCTGCTTCGGGCGGCCCGCTCCCGTGGAGCCGGCACA 

GCCTGGCCCTCGGCCATCGCCGGCCTCGTGGTGTTCCTCACGGCCAACGT 

GCTGCGGGCCTTCTCGCCCAAGTTCGGGGAGCTGGTGGCAGAGGAGGCGC 
45 GGCGGAAGGGGGAGCTGCGCTACATGCACTCGCGTGTGGTGGCCAACTCG 

GAGGAGATCGCCTTCTATGGGGGCCATGAGGTGGAGCTGGCCCTGCTACA 

GCGCTCCTACCAGGACCTGGCCTCGCAGATCAACCTCATCCTTCTGGAAC 

GCCTGTGGTATGTTATGCTGGAGCAGTTCCTCATGAAGTATGTGTGGAGC 

GCCTCGGGCCTGCTCATGGTGGCTGTCCCCATCATCACTGCCACTGGCTA 
50 CTCAGAGTCAGATGCAGAGGCCGTGAAGAAGGCAGCCTTGGAAAAGAAGG 

AGGAGGAGCTGGTGAGCGAGCGCACAGAAGCCTTCACTATTGCCCGCAAC 

CTCCTGACAGCGGCTGCAGATGCCATTGAGCGGATCATGTCGTCGTACAA 

GGAGGTGACGGAGCTGGCTGGCTACACAGCCCGGGTGCACGAGATGTTCC 

AGGTATTTGAAGATGTTCAGCGCTGTCACTTCAAGAGGCCCAGGGAGCTA 
5 5 GAGGACGCTCAGGCGGGGTCTGGGACCATAGGCCGGTCTGGTGTCCGTGT 

GGAGGGCCCCCTGAAGATCCGAGGCCAGGTGGTGGATGTGGAACAGGGGA 

TCATCTGCGAGAACATCCCCATCGTCACGCCCTCAGGAGAGGTGGTGGTG 

GCCAGCCTCAACATCAGGGTGGAGGAAGGCATGCATCTGCTCATCACAGG 
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CCCCAATGGCTGCGGCAAGAGCTCCCTGTTCCGGATCCTGGGTGGGCTCT 
GGCCCACGTACGGTGGTGTGCTCTACAAGCCCCCACCCCAGCGCATGTTC 
TACATCCCGCAGAGGCCCTACATGTCTGTGGGCTCCCTGCGTGACCAGGT 
GATCTACCCGGACTCAGTGGAGGACATGCAAAGGAAGGGCTACTCGGAGC 
5 AGGACCTGGAAGCCATCCTGGACGTCGTGCACCTGCACCACATCCTGCAG 
CGGGAGGGAGGTTGGGAGGCTATGTGTGACTGGAAGGACGTCCTGTCGGG 

TGGCGAGAAGCAGAGAATCGGCATGGCCCGCATGTTCTACCACAGGCCCA ' 

AGTACGCCCTCCTGGATGAATGCACCAGCGCCGTGAGCATCGACGTGGAA 

GGCAAGATCTTCCAGGCGGCCAAGGACGCGGGCATTGCCCTGCTCTCCAT 
1 0 CACCCACCGGCCCTCCCTGTGGAAATACCACACACACTTGCTACAGTTCG « 

ATGGGGAGGGCGGCTGGAAGTTCGAGAAGCTGGACTCAGCTGCCCGCCTG 

AGCCTGACGGAGGAGAAGCAGCGGCTGGAGCAGCAGCTGGCGGGCATTCC 

CAAGATGCAGCGGCGCCTCCAGGAGCTCTGCCAGATCCTGGGCGAGGCCG 

TGGCCCCAGCGCATGTGCCGGCACCTAGCCCGCAAGGCCCTGGTGGCCTC 
15 CAGGGTGCCTCCACCTGACACAACCGTCCCCGGCCCCTGCCCCGCCCCCA 

AGCTCGGATCACATGAAGGAGACAGCAGCACCCACCCATGCACGCACCCC 

GCCCCTGCATGCCTGGCCCCTCCTCCTAGAAAACCCTTCCCGCCCTCGGG 

AAAGTAGATGTGGAGGGTGGCGCCCTGCGTAACCCTCGCCCTGTCCCTCC 

CACTCCCTGGGGGCGCTGTTCCACAGTGACTGGGCCCTGTCCAGGGCAGT 
20 GAGTCCTCTACTTTGCTCCGTGGAGGAAGCTGGGGTACAAGGGGCCCAGT 

GCTGGCCACACAGCAG CGCAGCCGAGC CCCAGGAGCCCGTCAGGC CACAG 

CCCCTGGCACTGCAGGTGGCCTCCCTCCAGAGACTCGAGTCCCCATGATT 

CCCTCCTCGTCAGTCTCTCAAAGACCCCATGGTCCATCCCCTGAGGGTGG 

TCAGCCAAGGCTCCCGTTCCGTGGGATGCCATAAAAGCCGCCCAGTGGGA 
25 CCCACAGTCACACAGAGCGCCTCACCTGCATCCTCTCCCCCACAAGAGCC 

CCAAAGATCCCACGGGAGAGGGGAGAGGGACGCACAGCACTGCCTGCCAA 

GCGAGAATGCAGGCCCCGCCCCCTCGGCCCCTCACCACCTCTTTCTACAG 

CCTAATTTATTGGATTCCCTATTCGTAGCCATCTCCGTGGCCAATGTGAC 

TACCGTGCCAGCAGCGGGGGCGGCCCAGCCTCTGAGTCCCGTGGGGCCCC 
30 GGCTCCCACCGGTGCCAAACCCAGCCCCTGCGGCCGTCACCCCGCCAGCC 

TACACTGCCAGCCGCCACCGGGGCACACGGGCCTCTGCTTGCCAGCCAGG 

AGTGCGGACACCATGTTCCCAGCTCAGTGCCAAAGAGGGGTCACCAGGGG 

GAGCTGTCTGCGGAGCCAGCGCCTGCCCGAGAGAGACCCCACCGCCACCG 

TGTGCCTTTCCCGGGCCCTCAGCCCTCGGGCCGGGCACCACCCCCAGTCC 
3 5 CCCCAGTAAAAGCCTCCACTGGCAAAAAAAAAAAAAAAAAAAA 



>gi | 7262392 | ref |NM_000033 . 2 | Homo sapiens ATP-binding cassette, sub-family 
D (ALD) , member 1 (ABCD1) , mRNA 
40 GCGGACGGACGCGCCTGGTGCCCCGGGGAGGGGCGCCACCGGGGGAGGAGGAGGAGGAGAAGGTGGAGAG 
GAAGAGACGCCCCCTCTGCCCGAGACCTCTCAAGGCCCTGACCTCAGGGGCCAGGGCACTGACAGGACAG 
GAGAGCCAAGTTCCTCCACTTGGGCTGCCCGAAGAGGCCGCGACCCTGGAGGGCCCTGAGCCCACCGCAC 
CAGGGGCCCCAGCACCACCCCGGGGGCCTAAAGCGACAGTCTCAGGGGCCATCGCAAGGTTTCCAGTTGC 
CTAGACAACAGGCCCAGGGTCAGAGCAACAATCCTTCCAGCCACCTGCCTCAACTGCTGCCCCAGGCACC 
45 AGCCCCAGTCCCTACGCGGCAGCCAGCCCAGGTGACATGCCGGTGCTCTCCAGGCCCCGGCCCTGGCGGG 
GGAACACGCTGAAGCGCACGGCCGTGCTCCTGGCCCTCGCGGCCTATGGAGCCCACAAAGTCTACCCCTT 
GGTGCGCCAGTGCCTGGCCCCGGCCAGGGGTCTTCAGGCGCCCGCCGGGGAGCCCACGCAGGAGGCCTCC 
GGGGTCGCGGCGGCCAAAGCTGGCATGAACCGGGTATTCCTGCAGCGGCTCCTGTGGCTCCTGCGGCTGC 
TGTTCCCCCGGGTCCTGTGCCGGGAGACGGGGCTGCTGGCCCTGCACTCGGCCGCCTTGGTGAGCCGCAC 
50 CTTCCTGTCGGTGTATGTGGCCCGCCTGGACGGAAGGCTGGCCCGCTGCATCGTCCGCAAGGACCCGCGG 
GCTTTTGGCTGGCAGCTGCTGCAGTGGCTCCTCATCGCCCTCCCTGCTACCTTCGTCAACAGTGCCATCC 
GTTACCTGGAGGGCCAACTGGCCCTGTCGTTCCGCAGCCGTCTGGTGGCCCACGCCTACCGCCTCTACTT 
CTCCCAGCAGACCTACTACCGGGTCAGCAACATGGACGGGCGGCTTCGCAACCCTGACCAGTCTCTGACG 
GAGGACGTGGTGGCCTTTGCGGCCTCTGTGGCCCACCTCTACTCCAACCTGACCAAGCCACTCCTGGACG 
55 TGGCTGTGACTTCCTACACCCTGCTTCGGGCGGCCCGCTCCCGTGGAGCCGGCACAGCCTGGCCCTCGGC 
CATCGCCGGCCTCGTGGTGTTCCTCACGGCCAACGTGCTGCGGGCCTTCTCGCCCAAGTTCGGGGAGCTG 
GTGGCAGAGGAGGCGCGGCGGAAGGGGGAGCTGCGCTACATGCACTCGCGTGTGGTGGCCAACTCGGAGG 
AGATCGCCTTCTATGGGGGCCATGAGGTGGAGCTGGCCCTGCTACAGCGCTCCTACCAGGACCTGGCCTC 
GCAGATCAACCTCATCCTTCTGGAACGCCTGTGGTATGTTATGCTGGAGCAGTTCCTCATGAAGTATGTG 
60 TGGAGCGCCTCGGGCCTGCTCATGGTGGCTGTCCCCATCATCACTGCCACTGGCTACTCAGAGTCAGATG 
CAGAGGCCGTGAAGAAGGCAGCCTTGGAAAAGAAGGAGGAGGAGCTGGTGAGCGAGCGCACAGAAGCCTT 



SEQ ID NO:3 
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CACTATTGCCCGCAACCTCCTGACAGCGGCTGCAGATGCCATTGAGCGGATCATGTCGTCGTACAAGGAG 
GTGACGGAGCTGGCTGGCTACACAGCCCGGGTGCACGAGATGTTCCAGGTATTTGAAGATGTTCAGCGCT 
GTCACTTCAAGAGGCCCAGGGAGCTAGAGGACGCTCAGGCGGGGTCTGGGACCATAGGCCGGTCTGGTGT 
CCGTGTGGAGGGCCCCCTGAAGATCCGAGGCCAGGTGGTGGATGTGGAACAGGGGATCATCTGCGAGAAC 
5 ATCCCCATCGTCACGCCCTCAGGAGAGGTGGTGGTGGCCAGCCTCAACATCAGGGTGGAGGAAGGCATGC 
ATCTGCTCATCACAGGCCCCAATGGCTGCGGCAAGAGCTCCCTGTTCCGGATCCTGGGTGGGCTCTGGCC 
CACGTACGGTGGTGTGCTCTACAAGCCCCCACCCCAGCGCATGTTCTACATCCCGCAGAGGCCCTACATG 
TCTGTGGGCTCCCTGCGTGACCAGGTGATCTACCCGGACTCAGTGGAGGACATGCAAAGGAAGGGCTACT 
CGGAGCAGGACCTGGAAGCCATCCTGGACGTCGTGCACCTGCACCACATCCTGCAGCGGGAGGGAGGTTG 
10 GGAGGCTATGTGTGACTGGAAGGACGTCCTGTCGGGTGGCGAGAAGCAGAGAATCGGCATGGCCCGCATG 
TTCTACCACAGGCCCAAGTACGCCCTCCTGGATGAATGCACCAGCGCCGTGAGCATCGACGTGGAAGGCA 
AGATCTTCCAGGCGGCCAAGGACGCGGGCATTGCCCTGCTCTCCATCACCCACCGGCCCTCCCTGTGGAA 
ATACCACACACACTTGCTACAGTTCGATGGGGAGGGCGGCTGGAAGTTCGAGAAGCTGGACTCAGCTGCC 
CGCCTGAGCCTGACGGAGGAGAAGCAGCGGCTGGAGCAGCAGCTGGCGGGCATTCCCAAGATGCAGCGGC 
15 GCCTCCAGGAGCTCTGCCAGATCCTGGGCGAGGCCGTGGCCCCAGCGCATGTGCCGGCACCTAGCCCGCA 
AGGCCCTGGTGGCCTCCAGGGTGCCTCCACCTGACACAACCGTCCCCGGCCCCTGCCCCGCCCCCAAGCT 
CGGATCACATGAAGGAGACAGCAGCACCCACCCATGCACGCACCCCGCCCCTGCATGCCTGGCCCCTCCT 
CCTAGAAAACCCTTCCCGCCCTCGGGAAAGTAGATGTGGAGGGTGGCGCCCTGCGTAACCCTCGCCCTGT 
CCCTCCCACTCCCTGGGGGCGCTGTTCCACAGTGACTGGGCCCTGTCCAGGGCAGTGAGTCCTCTACTTT 

20 GCTCCGCGGAGGAAGCTGGGGTACAAGGGGCCCAGTGCTGGCCACACAGCAGCGCAGCCGAGCCCCAGGA 
GCCCGTCAGGCCACAGCCCCTGGCACTGCAGGTGGCCTCCCTCCAGAGACTCGAGTCCCCATGATTCCCT 
CCTCGTCAGTCTCTCAAAGACCCCATGGTCCATCCCCTGAGGGTGGTCAGCCAAGGCTCCCGTTCCGTGG 
GATGCCATAAAAGCCGCCCAGTGGGACCCACAGTCACACAGAGCGCCTCACCTGCATCCTCTCCCCCACA 
AGAGCCCCAAAGATCCCACGGGAGAGGGGAGAGGGACGCACAGCACTGCCTGCCAAGCGAGAATGCAGGC 

25 CCCGCCCCCTCGGCCCCTCACCACCTCTTTCTACAGCCTAATTTATTGGATTCCCTATTCGTAGCCATCT 
CCGTGGCCAATGTGACTACCGTGCCAGCAGCGGGGGCGGCCCAGCCTCTGAGTCCCGTGGGGCCCCGGCT 
CCCACCGGTGCCAAACCCAGCCCCTGCGGCCGTCACCCCGCCAGCCTACACTGCCAGCCGCCACCGGGGC 
ACACGGGCCTCTGCTTGCCAGCCAGGAGTGCGGACACCATGTTCCCAGCTCAGTGCCAAAGAGGGGTCAC 
CAGGGGGAGCTGTCTGCGGAGCCAGCGCCTGCCCGAGAGAGACCCCACCGCCACCGTGTGCCTTTCCCGG 

30 GCCCTCAGCCCTCGGGCCGGGCACCACCCCCAGTCCCCCCAGTAAA 

SEQ ID NO:4 

>gi (7262393 |ref | NP_000024 . 2 | ATP-binding cassette, sub-family D (ALD) , 
member 1; adrenoleukodys trophy protein [Homo sapiens] 
35 MPVL S RPRP WRGNTLKRTAVLIjALAAYGAHKVYPLVRQCL APARGLQAPAGEPTQEAS GVAAAKAGMNRV 
FLQRLLWLLRLLFPRVLCRETGLLALHSAALVSRTFLSV^ 

ALPATFVNSAIRYLEGQLALSFRSRLVAHAYRLYFSQQTYYRVSNMDGRLRNPDQSLTEDW 
LYSNLTKPLLDVAVTSYTLLRAARSRGAGTAWPSAIAGLVVFLTANVLRAFSPKFGEIiVAEEARRKGELR 
YMHSRWANSEEIAFYGGHEV^LALLQRSYQDLASQI^ 
40 IITATGYSESDAEAVKKAALEKKEEELVSERTEAFTIARl^^ 

EMFQVFEDVQRCHFKRPRELEDAQAGSGTIGRSGVRVEGPLKIRGQWDVEQGXICENIPIVTPSGEVW 
ASLNIRVEEGMHLLITGPNGCGKSSLFRILGGLWPTYGGVLYKPPPQRMFYIPQRPYMSVGSLRDQVIYP 
DSVEDMQRKGYSEQDLEAILDVVHLHHILQREGGWEAMC^ 

CTSAVSIDVEGKIFQAAKDAGIALLSITHRPSLWKYHTHLLQFDGEGGWKFEKLDSAARLSLTEEKQRLE 
45 QQIiAGIPKMQRRLQELCQILGEAVAPAHVPAPSPQGPGGLQGAST 

SEQ ID NO:5 

PCBPl 

GL2_131_2_M13F 

50 GCTGCGGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAATCATCACATCATGACCGGCCCCACCAATGCCATC 
TTTAAGAGCTTTCGCTATGATCATCGACAAGACTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGCGG 
CACAGCAGGCCCCACGGGTCACCCTGAGGCTGGATGGTGCCGGCCACCCAGATGCGGCTCCCTGATANGGGAAAG 
GCGGGTGTAAGATCAAAGAGATCANNAGAGAGTACGGGGNGCGCAGGTCCAAGGAATGGCAGGAGGGATATGCAT 
GCCCAAAAACAACCGAG ACGGG CCAAANACCATAGCTGGACAGTGC CGAAGGACTGTC ACCNGAAGAT 

SEQ ID NO:6 

GL2-77-2M13R 

CGCTGCGGCCTCCCGCCCGCTCCCGCTCGCTCCCGCGGCCTCGCTCGCCTCGCGCCGGCAGTTTTGGGCCTACAC 
CTCCCCTCCCCCCGCCAGCCGCCAAAGACTTGACCACGTAACGAGCCCAACTCCCCCGAACGCCGCCGCCGCTCG 
60 CCATGGATGCCGGTGTGACTGAAAGTGGACTAAATGTGACTCTCACCATTCGGCTTCTTATGCACGGAAAGGAAG 
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TAGGAAGCATCATTGGGA^^AGGGGAGTCGGTATAAGAGGATCCGCGAGGAG^^GGNGCGCGGATCAACATC 
TCGGAGGGGAATTTGTACCGGAGAGAATCATCACTCTGACCGGCNCCCAACAAATGCACAANCNTAAAGGACNTA 
AACGACAAATAGAATCATACGACAAANCTAGAGAGAAGATATCAACAGACTCCAATGAACAAAANAAACCGCAGA 
GACAGACAGGAACCCCNGGAAC 

5 

SEQ ID NO:7 

GL2_2 09_3_M13R 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCCGCCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCC 
10 ACTTGGCGGAGACCTGGGCAGAGGCCGGTTCTCCCAGGGCAATGACCCCAGGCAAATCTGCTTGAGACACACCTA 
CGGTGCACAGACTGGCGGGGCAACGGCACAGAGCAGAAGTTGCGGTGAACAGCTGGGCACACAAGGACGA 

SEQ ID NO:8 

GL2 131 3 M13F 

15 

CATGGATGCCGGTGTGACTGAAAGTGGACTAAATGTGACTCTCACCATTCGGCTTCTTATGCACGGAAAGGAAGT 
AGGAAGCATCATTGGGAAGAAAGGGGAGTCGGTTAAGAGGATCCGCGAGGAGAGTGGCGCGCGGATCAACATCTC 
GGAGGGGAATTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAATGCCATCTTTAAGACTTTCGCTATGAT 
CATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGCGGCCAGCAGGCCCCCGGTCACCCT 
GAGGCTGGTGGTGCCGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGGCGGGTGTAAGATCAAAGAGATCCGCG 

20 

SEQ ID NO:9 

GL2_2 05__C05_G3F1 

AGAGGCGCAGCGGTCGCAGCGNTGGCTCAGTGTGCTGGGTTCTTGCGGCTTGAATCGAGTAGGCATCCAGAGGTG 
GTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGCACCGATCTTGGCGCCCGCGCAGATGAC 

25 TGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCCTTGCGGAGACTGGGAGAGCGTCTCCAG 
CATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCACGCCAGCGATGGTGATGGCCCGCTCGGT 
GGAGTTTGGGCAGCATATCCCCCGCAACTGGACCTGCGCCCCCGTACTCTCGCGGATCTCTTTGATCTTACACCC 
GCCTTTCCCAATCAGTGGAGCCGCACTGGGTGGGCGGCAACAACAGCTCAGGGTGACCGGGGGCCTGCTGGCCGC 
GGTACTGTTTGGTCATGGAGCTGTTGATATCTTCCTCCAGCTTGTCGATGATCATAGCGAAAGCCTTAAAGATGG 

30 CATTGGTGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTCCCCTCCGAGATGTTTGATCCGGGCGCCACTC 
TCCTCGCGGATCCTCTTAACGAATCCCCTTTCTTCCCAATGATGCTTCTACTTCCTTTCCGTGCATAGGAAGCCG 
AATGGTGAGAGTCACATTTTAGTCCACTTTTCAGTCACACCGGGGATCCATGGNCGANCGGGCGGGGACGGGCGT 
ACGGGGGAATTGGGCTCGTAACAGTTGGNCCAGNTCNTTAGGCGGTGAGAGGGAGGGAAGGGAGGCCAACAATGG 
CGGGGAGAAGAAGGCACGACAACAGAGTGCGGAGGACAACCAACACAACAAGGTAAAAAA 

35 

SEQ ID NO: 10 

GL2_131_3JVI13R 

CTTGAATCGAGTAGGCATCCAGAGNGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTG 
CACCGATCTTGGCCGCCCGCGTAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTC 

40 CCTTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGC 
ACGCCAGCGATGGTGATGGCCCGCTCGGNGAGAGTTAGGGCACGCAATATCCCCACGAACAAACTGGAACTGACG 
GCCCCCGAAACTCTACGCNAGATACTCNTTAAGAAACTTNACACCCGGACTTATACCCAAATCAGAGAAGCCGCA 
ACATGGGTGGCACCGGAAAACAACAGAACTCCAGGGGATGAACCGGAGGGGACATAGAAAGGGAACAGACGGGGA 
TACCATAGGTCTGGCGGTCCCAATGGGGAGGCCTGCNTATGAACTAAATCCTTACCCTTCCCAANCACATAGGNA 

45 TCCGGAATGGAAATCAAATAGGCGGAAAAANGTCCCTTTAAAACAGAAAATNGGGCAACTTNGMAAGGGGGGGAC 
CACGAGACANAACAAACCGAAGACATACGCAACCAGGGAACAAAANTTACCCCGACACGCAGAGAAGGNTGGACA 
NCACGGCACAGACCACAGTACCCATCNNAGACGGACACCG 

SEQ ID NO: 11 

50 GL2_2 09_1_M13R 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCCGCCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCC 
CTTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCA 
CGCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCACACCTGGACCTGCGCCCCCGTA 

55 CTCTCGCGGATCTCTTATGATCTTAACAACACACGACACNTTTTTCNCANCAAAATCCAGGGAGCCGACAACATG 
GAGTGAGCCCGGACACCACACAGACCTCAGGGTGACCGGGGGCCTGCATGGCCGACGGATACNGAANGGTCATGG 
ACGCTGTGATATCTTCCTC C AGCTAGTCGATGATCATAGCGAAAAGC C C CTTTCACAACGAAATGAGGCCATTNG 
AGATNGGGGCGGCCCGGGATCCAGGAAGGTTCGCACTAGGAACTNACGTACCATTATCCCCAGGGAGAAAACNCC 
AGAACAATATTACACCCACCCACCGATTACCCCAAGAAAGAAA 

60 
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SEQIDNO:12 

GL2_131_2JVI13R 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
GCCGATCTTGGCCGCCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCC 
5 CTTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCA 
CGCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCACGAGCANCTGGACCTGCGCCCCCGA 
TACTCTCGACGGATCTCTTTGATCTTACACCCGACCTTTCCCAATCAGGGAAGCCGCACATGGGATGGCACGGCA 
CCACCAGACCTCAGGGATGACCAGGGAGGCCTGCATGGACCGCAGGATACTGATAGGATCATGGAGNCTGATAGA 
TATCTTCCCTCCAGACTATGNTCGATGATCAATAAGGCGGAAAAGGCCATAAANAGNAATGGGCAATTAGGGATC 
10 GGGAGGACCAGGATCAAGA 

SEQIDNO:13 

GL2_81_1_M13F 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGT 

15 

TTTCGGGGGAAGGAGTAAAGGCCCCCACCGGGACCAATTGGCAGGGTCCGCTGCNACCGATACTTGGCCTGCCGC 
GCAGATGACTGGGGAGCTGGTCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCCTTGCGGAGACTGGGAGAG 
CGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCACGCCAGCGATGGTGATGGC 
CCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTACTCTCGCGGATCTCTTTGAT 
CTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCCTCAGGGTGACCGGGGGCCT 
20 GCTGGCCGCGGTACTGTTGGTCATGGGGCTGTTGATATCTTCCTCCAGCTTGTCG 

SEQlDNO:14 

GL2_77_1JVI13F 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
25 ACCGATCTTGGCCGCCCGCGCAGATACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCC 
TTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACCCTCGGTGACAGACTGCGGCAC 
GCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTACT 
CTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCCT 
CAGGGTGACCGGGGGCCTGCTGGCCGCGGTACTGTTGGTCATGGAGCTGTTGATAT 

30 

SEQIDNO:15 

GL2_77__1_M13R 

AGCTTGGTACGAGCTCGGATCACTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTCGAGGAGAGTGGCGCGC 
GGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAATGCCATCTTTAAGG 

35 CTTTCGCTATGATCATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGCGGCCAGCAGGC 
CCCCGGTCACCCTGAGGCTGGTGGTGCCGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGGCGGGTGTAAGATCA 
AAGAGATCCGCGAGAGTACGGGGGCGCAGGTCCAGGTGGCGGGGGATATGCTGCCCAACTCCACCGAGCGGGCCA 
TCACCATCGCTGGCGTGCCGCAGTCTGTCACCGAGGGTGTCAAGCAGATTTGCCTGGTCATGCTGGAGACGCTCT 
CCCAGTCTCCGCAAGGGAGAGTCATGACCATTCCGTACCAGCCCATGCCGGCCAGCTCCCCAGTATCTGCGCGGG 

40 CGGCCAAGATCGGTGCAGCGACGCTGCGGGCTACCCCCATGCCACCCATGACCTGGAGGGACCACCTCTGGATGC 
CTACTCGATTCAAG 

SEQIDNO:16 

GL2_77_2JVI13F 

45 CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCCGCCCGCGCAGATACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCC 
TTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCAC 
GCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTACT 
CTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCCT 

50 CAGGGTGACCGGGGGCCTGCTGGCCGCGGTACTGTTGGTCATGGAGCTGTTGATATCTTCCTCCAGCTTGTCGAT 
GATCATAGCGAAAGCCTTAAAGATGGCATTGGTGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTCCCCTC 
CGAGATGTTGATCCGCGCGCCATCTCCTCGCGGATCCTCTTAAACCGATCCCCTTTCTTCCCAATGATGCTTCTA 
CTTCTTTCCGTGCATAGAA 

55 SEQIDNO:17 

GL2__77_3_M13F 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTG 
ACCGATCTTGGCCGCCCGCGCAGATACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCC 
TTGCGGAGACTGGGAGAGCGTCTCCAGCATGACC^GGCA 
60 GCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTACT 
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CTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCCT 
CAGGGTG 

SEQIDNO:18 

5 GL2_77_4_M13F 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCCGCCCGCGCAGATACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCC 
TTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCAC 
GCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTACT 
10 CTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCCT 
CAGGGTGACCGGGGGCCTGCTGGCCGCGGTACTGTTGGTCATGGAGCTGTTGATATCTTCCTCCAGCTTGTCGAT 
GATCATAGCGAAAGCCTTAAAGATGGCATTGGTGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTCCCCTC 
CGAGATGTTGATCCGCGCGCCACTCTCCTCGCGGATCCTCT 

15 SEQIDNO:19 

GL2_77_5_M13F 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCCGCCCGCGCAGATACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCC 
TTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCAC 

20 GCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTACT 
CTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCCT 
CAGGGTGACCGGGGGCCTGCTGGCCGCGGTACTGTTGGTCATGGAGCTGTTGATATCTTCCTCCAGCTTGTCGAT 
GATCATAGCGAAAGCCTTAAAGATGGCATTGGTGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTCCCCTC 
CGAGATGTTGATCCGCGCGCCACTCTCCTCGCGGATCCTCTTAACCGACTCCCCTTTCTTCCCAAATGATGCTTC 

25 CTACTTCCCTTTTCCGTGCATAAGAAAGCCGAAATGGTGAGAG 

SEQIDNO:20 

GL2_131_1_ M13F 

NNGCTGCGGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAATCATCACATCATGACCGGCCCCACCAATGCCA 
30 TCTTTAAGAGCTTTCGCTATGATCATCGACAAGACTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGC 
GGCACAGCAGGCCCCACGGGTCACCCTGAGGCTGGATGGTGCCGGCCACCCAGATGCGGCTCCCTGATANGGGAA 
AGGCGGGTGTAAGATCAAAGAGATCANNAGAGAGTACGGGGNGCGCAGGTCCAAGGAATGGCAGGAGGGATATGC 
ATGCCCAAAAACAACCGAGACGGGCCAAAMACCATAGCTGGACAGTGCCGAAGGACTGTCACCNGAAGAT 

35 SEQIDNO:21 

GL2_131_1_M13R 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 

ACCGATCTTGGCCGCCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCC 

CTTGCGGGAGAACTGGGGAAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGC 

40 GGCACGCCAGCAATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCC 
GTACTCTCGCGGATCTCTT 

SEQ ID NO:22 

GL2_2 09_2_M13R 

45 GGCGCCAGTGTGCTGCNATTCGCNTATCTCTCGGCATGGACGAGCTGGTACAAAAAGGAGGAGGGCCCGCAAGTC 
GGTGGCAGCGGTGGCTCCAGTGTTGGCTGGGGGGGTTCTGCGGCTTGAATCGGAAGGTTTAAGGGGGCATCCAGA 
GGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGCACCGATCTTGGCGCCGCGCCAGA 
TGACTGGGGGAAGCTGGCCGGCATGGGCTTGGTACGGAA 

50 SEQIDNO:23 

GL2_2 09_3JM13F 

CGCTGCGGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAATGCCATCT 

TTAAGGCTTTCGCTATGATCATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGCGGCCA 

GCAGGCCCCCGGTCACCCTGAGGCTGGTGGTGCCGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGGCGGGTGTA 

55 AGATCAAAGAGATCCGCGAGAGTACGGGGGCGCAGGTCCAGGTGGCGGGGGATATGCTGCCCAACTCCACCGAGC 

GGGCCATCACCATCGCTGGCGTGCCGCAGTCTGTCACCGAGTGTGTCAAGCAGATTTGCCTGGTCATGCTGGAGA 

CGCTCTCCCAGTCTCCGCAAGGGAGAGTCATGACCATTCCGTACCAGCCCATGCCGGCCAGCTCCCCAGTCATCT 

GCGCGGGCGGCCAAAGATCGGNGCAGCGACGCTGCGGGCTACCCCACAATAGACACACCATGACCTGGAGGGAAC 

AACTACTGGATGCTACATCCGANNTCAAGCCGGAAAAANCCAACAAACTGGGAACCAC^CGATAGACACACGTCG 
60 CCTTGGCAGGCATCACCACCTATGAA 
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SEQIDNO:24 

GL2_209_4_M13R 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
5 ACCGATCTTGGCCGCCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCC 
CTTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCA 
CGCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCCACCTGGACCTGCGCCCCCGTAC 
TCTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGGCCGGCACCACCAGCC 
TCAGGGTGACCGGGGGCCTGCTGGCCGCGGTACTGTTGGTCATGGAGCTGTTGATATCTTCCTCCAGACTTGTCG 

10 ATGATCATAGCGAAAGCCTTAAAAGATGGCATTGGTGGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTCC 
CCTTCGAAGATGTGATCCGCGCGCACATCTCCTCGCGGATCTCTTAAACCGAGTCCCNTTTCATCCAAGAATGNA 
CTCCTACTCCCTATACCGGCCATAANAANGCCGAAATGGGTGAAGAANGTCACACTTTAAGGTCCACACTTATCA 
GCACANCCCGGGATCCACAGGGGCAAACGGCCGGGAGCAGGAAGGCCCGAGAGGCCCGGAAAAACCCANACAACA 
GGGTTAAATAAAAATAAAGGGCCGCAAAAAAATCGAAGGAATCGAGAAAAACANGCCGCCAAATCGGCACACTAA 

1 5 TGAACGATAACAACTAAAGACAGA 

SEQ ID NO:25 

GL2_131_4_M13F 

CGCTGCGGGCTCCCGGCCCGGCTCGCCATGGATGGCCGGTGTGACTGAAAGTGGACTAAATGTGANTCTCACCAT 
20 TCGGCTTCTTATGCACGGAAAGGAAGTAGGAAGCATCATTGGGAAGAAAGGGGAGTCNGTTAAGANGATCCGTGA 
GGAGAGTGGCGCGCGGATCAACTATCTCGTGAGGGAATTGTCNCGGAGAGAANCATCACTCTGACCGGCCCCACC 
AATGANCANCTTTAAGGCTTGCGCATATGATCATCGACAAGCNGGAGGAAGATATCAACAGANTACCATGACCAA 
CAGTACCGCGGGACAGACANGCCCCCGGNCAGACANGAGGCTGGAGGNAGCCGGACCAGC 

25 SEQ ID NO:26 

GL2_2 0 5_C04_G3 PI 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCCGCCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCC 
CTTGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCA 

30 CGCCAGCGATGGTGATGGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGCACTGGACCTGCGCCCCCGTACTC 
TCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGGGAGCCGCACTGGGTGTGCCGGCACCACCAGCCT 
CAGGGTGACCGGGGGCCTGCTGGCCGCGGTACTGTTGGTCATGGAGCTGTTGATATCTTCCTCCAGCTTGTCGAT 
GATCATAGCGAAAGCCTTAAAGATGGCATTGGTGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTCCCCTC 
CGAGATGTTGATCCGCGCGCCACTCTCCTCGCGGATCCTCTTAAACGATCCCCTTTCTTCCCAATGATGCTTCCA 

35 TACTTCCTTTCCGTGCATAAGAAGCCGAAATGGTGAGAGTCACCATTTAGGTCCACTTTCAGTCAACACCGGATC 
CATGGGCGAGCGGGGGCAGGACGTACCGGGGNGAGATTGGGCTCGAACAGTGGGCAAAGACAGGACAGA 

SEQ ID NO:27 

GL2_77_4JVI13R 

40 CGCTGCGGCCTCCCGCCCGCTCCCGCTCGCTCCCGCGGCCCTCGCTCGCCTCGCGCCGGCAGTTTTGGGCCTACA 
CCTCCCCTCCCCCCGCCAGCCGCCAAAGACTTGACCACGTAACGAGCCCAACTCCCCCGAACGCCGCCCGCCGCT 
CGCCATGGATGCCGGTGTGACTGAAAGTGGACTAAATGTGACTCTCACCATTCGGCTTCTTATGCACGGAAAGGA 
AGTAGGAAGCATCATTGGGAAGAAAGGGGAGTCGGTTAAGAGGATCCGCGAGGAGAGTGGCGCGCGGATCAACAT 
CTCGGAGGGGAATTTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAATGCCATCTTTAAGGCTTTCGCTA 

45 TGATCATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGCGGCCAGCAGGCCCCCGGTCA 
CCCTGAGGCTGGTGGTNGCCGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGGCGGGTGTAAGATCAAAGAGATC 
CGCGAGAGTACGGGGGCGCAGGTCCAGGT 

SEQ ID NO:28 

50 GL2_77_5_M13R 

CGCTGCGGCCTCCCGCCCGCTCCCGCTCGCTCCCGCGGCCCTCGCTCGCCTCGCGCCGGCAGTTTTGGGCCTACA 
CCTCCCCTCCCCCCGCCAGCCGCCAAAGACTTGACCACGTAACGAGCCCAACTCCCCCGAACGCCGCCCGCCGCT 
CGCCATGGATGCCGGTGTGACTGAAAGTGGACTAAATGTGACTCTCACCATTCGGCTTCTTATGCACGGAAAGGA 
AGTAGGAAGCATCATTGGGAAGAAAGGGGAGTCGGTTAAGAGGATCCGCGAGGAGAGTGGCGCGCGGATCAACAT 
55 CTCGGAGGGGAATTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAATGCCATCTTTAAGGCTTTCGCTAT 
GATCATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTACCGCGGCCAGCAGGCCCCCGGTCAC 
CCTGAGGCTGGTGGTGCCGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGGCGGGTGTAAGATCAAAGAGATCCG 
CGAGAGTACGGGGGGCGCAGGTCCAGGG 
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SEQ ID NO:29 

GL2__2 09_4JV113F 

CGCTGCGGCCTCCCGCCCGCCGCTCGCCATGGATGCCGGTGTGACTGAAAGTGGACTAAATGTGACTCTCACCAT 
TCGGCTTCTTATGCACGGAAAGGAAGTAGGAAGCATCATTGGGAAGAAAGGGGAGTCGGTTAAGAGGATCCGCGA 
5 GGAGAGTGGCGCGCGGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAA 
TGCCATCTTTAAGGCTTTCGCTATGATCATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTAC 
CGCGGCCAGCAGGCCCCCGGTCACCCTGAGGCTGGTGGTGCCGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGG 
CGGGTGTAAGATCAAAGAGATCCGCGAGAGTACGGGGGCGCAGGTCCAGGTGGCGGGGGATATGCTGCCAACTCC 
ACCGAGCGGGCCATCACCATCGCTGGCGTGCCGCAGTCTGTTCACCGAAGTGTGTCACAGCNAGATTTGCCTGGT 

10 

CATGCTTGGAAAACGGCTTCTCCCCAANTACCTTCCNGCAAAGGGGAGAAGTCCATTGAACCCANTTCCCCGCGN 
AACCAACAGCCCCCAAATGGGCCCGGGGCCACAGGCTCCCCCCAAGGACANATCNGGAGGCCGGGGGGACGGGCC 
CAACGAATCCGGGGAGGCAAGACGAANACATGCAGGGCATAACCCCCCANGGCACACCCATGAACCTGGGAAGGG 
GACCACCTCTGGGAATGGCNAATCGAGTCAAGCCAGAAAAACCAGCACACG 



15 SEQIDNO:30 

GL2_2 0 5_C0 6 J33 F 1 

CTTGAATCGAGTAGGCATCCAGAGGTGGTCCCTCCAGGTCATGGGTGGCATGGGGGTAGCCCGCAGCGTCGCTGC 
ACCGATCTTGGCGCCGCGCAGATGACTGGGGAGCTGGCCGGCATGGGCTGGTACGGAATGGTCATGACTCTCCCT 
TGCGGAGACTGGGAGAGCGTCTCCAGCATGACCAGGCAAATCTGCTTGACACACTCGGTGACAGACTGCGGCACG 

20 

CCAGCGATGGTGATGNGCCCGCTCGGTGGAGTTGGGCAGCATATCCCCCGGCACCTGGANCTGCGCCCCCGTACT 
CTCGCGGATCTCTTTGATCTTACACCCGCCTTTCCCAATCAGTGGAGCCGCACTGGGTGTGCCTGGCACCACCAG 
CCTCAGGGTGACCGGGGTGCCTGCTGGCCGCGGTACTGTTTGGTCATGGAGCTGTTGATATCTTCCTCCAGCTTG 
TCGATGATCATAGCGAAAGCCTTAAAGATGGCATTGGTGGGGCCGGTCAGAGTGATGATTCTCTCCGGACAATTC 
CCTCCGAGATGTTGATCCGCGCGCCAATCTCCTCGCGGATCCTCTTAACGAATCCCCTTTCTTCCAAANGATGAT 

25 

TCCTANTTCCTTTACCGTGCATATAGAAAGCCCGAAATGGTTGACGAGTCCACATTTAGTCCACACTTTCAGTCA 
CACCCGGGAATCCAATGGGCGAGCGGAGGGACGGAGACGGGGGGAAGATGGGCGCCGAACGTTGGGCCAAGAATA 
GAGCAAGAGAGNAGAAGGAAAAANGACAAAGAGCACAAACAGAAAAAAGACAAAAAGGNAGCAAGCACACNAAAA 
AA 

30 SEQIDNO:31 

GL2_2 09_2_M13F 

CGCTGCGGCCTCCCGCCCGCCGCTCGCCATGGATGCCGGTGTGACTGAAAGTGGACTAAATGTGACTCTCACCAT 
TCGGCTTCTTATGCACGGAAAGGAAGTAGGAAGCATCATTGGGAAGAAAGGGGAGTCGGTTAAGAGGATCCGCGA 
GGAGAGTGGCGCGCGGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAATCATCACTCTGACCGGCCCCACCAA 

35 

TGCCATCTTTAAGGCTTTCGCTATGATCATCGACAAGCTGGAGGAAGATATCAACAGCTCCATGACCAACAGTAC 
CGCGGCCAGCAGGCCCCCGGTCACCCTGAGGCTGGTGGTGCCGGACACCCAGTGCGGCTCCCTGATTGGGAAAGG 
CGGGATGTAAGATCAAAG 



SEQ ID NO:32 

40 >gi 1 14141164 | ref |NM_006196.2 | Homo sapiens poly(rC) binding protein 1 
(PCBP1) , mRNA 

GACGCCGCCCGACCCTGCGACTACGCTGCGGACTCCCGCCCGCTCCCGCTCGCTCCCGCGGTCCTCGCTC 
GCCTCGCGCCGGTAGTTTTGGGCCTACACCTCCCCTCCCCCCGCCAGCCGCCAAAGACTTGACCACGTAA 
CGAGCCCAACTCCCCCGAACGCCGCCCGCCGCTCGCC ATG GATGCCGGTGTGACTGAAAGTGGACTAAAT 

45 

GTGACTCTCACCATTCGGCTTCTTATGCACGGAAAGGAAGTAGGAAGCATCATTGGGAAGAAAGGGGAGT 
CGGTTAAGAGGATCCGCGAGGAGAGTGGCGCGCGGATCAACATCTCGGAGGGGAATTGTCCGGAGAGAAT 
CATCACTCTGACCGGCCCCACCAATGCCATCTTTAAGGCTTTCGCTATGATCATCGACAAGCTGGAGGAA 
GATATCAACAGCTCCATGACCAACAGTACCGCGGCCAGCAGGCCCCCGGTCACCCTGAGGCTGGTGGTGC 
CGGCCACCCAGTGCGGCTCCCTGATTGGGAAAGGCGGGTGTAAGATCAAAGAGATCCGCGAGAGTACGGG 

50 GGCGCAGGTCCAGGTGGCGGGGGATATGCTGCCCAACTCCACCGAGCGGGCCATCACCATCGCTGGCGTG 
CCGCAGTCTGTCACCGAGTGTGTCAAGCAGATTTGCCTGGTCATGCTGGAGACGCTCTCCCAGTCTCCGC 
AAGGGAGAGTCATGACCATTCCGTACCAGCCCATGCCGGCCAGCTCCCCAGTCATCTGCGCGGGCGGCCA 
AGATCGGTGCAGCGACGCTGTGGGCTACCCCCATGCCACCCATGACCTGGAGGGACCACCTCTAGATGCC 
TACTCGATTCAAGGACAACACACCATTTCTCCGCTCGATCTGGCCAAGCTGAACCAGGTGGCAAGACAAC 

55 AGTCTCACTTTGCCATGATGCACGGCGGGACCGGATTCGCCGGAATTGACTCCAGCTCTCCAGAGGTGAA 
AGGCTATTGGGCAAGTTTGGATGCATCTACTCAAACCACCCATGAACTCACCATTCCAAATAACTTAATT 
GGCTGCATAATCGGGCGCCAAGGCGCCAACATTAATGAGATCCGCCAGATGTCCGGGGCCCAGATCAAAA 
TTGCCAACCCAGTGGAAGGCTCCTCTGGTAGGCAGGTTACTATCACTGGCTCTGCTGCCAGTATTAGTCT 
GGCCCAGTATCTAATCAATGCCAGGCTTTCCTCTGAGAAGGGCATGGGGTGCAG CTAGA ACAGTGTAGGT 

60 TCCCTCAATAACCCCTTTCTGCTGTTCTCCCATGATCCAACTGTGTAATTTCTGGTCAGTGATTCCAGGT 
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TTTAAATAATTTGTAAGTGTTCAGTTTCTACACAACTTTATCATCCGCTAAGAATTTAAAAATCACATTC 
TCTGTTC^GCTGTTAATGCTGGGATCGATATT^ 

GGGTTTTTTGGCTCATGAATTTTATTTCTGTTTGTCGATAAGAAATGTAAGAGTGGAATGTTAATAAATT 
TCAGTTTAGTTCTGTAATGTCAAGAATTTAAGAATTAAAAAACGGATTGGTTAAAAAATGCTTCATATTT 
5 GAAAAAGCTGGGAATTGCTGTCTT 

SEQ ID NO:33 

>gi (5453 854 | ref | NP_006187 . 1 | poly (rC) -binding protein 1; heterogenous 
nuclear ribonucleoprotein X; alpha-CPl; nucleic acid binding protein sub 

10 2.3; heterogenous nuclear ribonucleoprotein El [Homo sapiens] 

MDAGVTESGLNVTLTIRLLMHGKE VGS I IGKKGESVKRIREESGARINISEGNCPERI ITLTGPTNAIFK 
AFAMI IDKLEED INS SMTNSTAASRPPVTIiRLVVPATQCGSL IGKGGCKI KE IRESTGAQVQVAGDMLPN 
STERAITI AGVPQSVTECVKQI CLVMLETLSQS PQGRVMTI PYQPMPAS S PVICAGGQDRCSDAVGYPHA 
THDLEGP PUD AYS I QGQHT I S PLDLAKLNQVARQQSHFAMMHGGTGFAG IDS S S PE VKGYWAS LDAS TQT 

15 THELT I PNNIi IGC 1 1 GRQGAN INB I RQMS GAQ I KI ANP VEGS SGRQVT I TGS AAS I S LAQ YL INARLS S E 
KGMGCS 



SEQ ID NO:34 

Amino acid transporter (SLC1A5) 

20 >G3_8_66_PCR_G3F1 

CTCGCCCGGTGAAGACGAAGTGCGCTCAAGCGCTCCAGTGCCCAACGCCAGCGCACCCCCGGCCCCCGACACCCC 
CAGTCCCACGCCACGCCGCCACCACGGCACCACTTACAACAACGGTTGCTCAAGCAGCGGCACCTTGTCCCGGGA 
ACCGCAGTAGCCGCCTGCTGCCGCGCCTTGGTCCTCGATGGAGGCCAGCGCCAGGCCCCCGTTGGCGGTGGCTCC 
GCCGCTCGAGCCCCTTGGAGTCTCGAGGAGGATCGGCCACCATGATGGAAGCACCGGGGTTTCTTAGCGCCTGGA 

25 AGCTGGCTGGGAGCGCTTGGGCTCCTTCCCAGGACCCGACGTTCCTAGGACTGAGTTGAGTAACAGCACCTGGAG 
ACTGGAACTTTGGAGGGCTCCTTAGAGTTGTGAGTTCACAGCACTAAGTTCCTTGGCTCTTGGAAGCTGGAGTGT 
TTAAATTCCCCAGGCTGGGCGCTAGGCTTCTC 



SEQ ID NO:35 

30 >GL3_8 . 66_AJVtl3F 

CTCGCCCGGGAAGACGAAGGCGCTCAAGCGCTCCAGGCCCAACGCCAGCGCACCCCCGGCCCCCGACACCCCCAG 
TCCCAGCGCCACGCCGGCCACCACGGCCACCACTGTCAGCAGCACAAGCAGGTTGGCTCGAAGGCAGCGGCGCAC 
CTGGTCCCGGGAACCGCAGTAGCCGCCTGCTGCCGCGCCTTGGTCCTCGATGGAGGCCAGCGCCAGGCCCCCGTT 
GGCGGTGGGCTCCGCCGCTGCGAGCCCCTTGGAGTCTCGAGGAGGATCGGCCACCATGATGGGAAGCACCGGGGT 

35 TTCTTAGCGCCTGGAAGCTGGCTGGGAGCGCTTGGGCTCCTTCCCAGGACCCGACGTTCCTAGGACTGAGTTGAG 
TAACAGCACCTGGAGACTGGAACTTTGGAGGGCTCCTTAGAGTTGTGAGTTCACAGCACTGAAGTTCCTTGGCTC 
TTGGAAGCTGGAGTGTTTAAATTCCCCAGGCTGGGCGCTGAGGCTTCTCTGCTCTGCCCCGTGTGCCAGATGTCC 
GAAAGCTGGGAGTTCGGAGCGCCCGGGTTTCCT 



40 SEQ ID NO:36 

>GL3_8 . 66_A_M13R Direction: anti-sense 

CTCCTAGGGCCAAGGAACCCGGGCGCTCCGAACTCCCAGCTTTCGGACATCTGGCACACGGGGCAGAGCAGAGAA 
GCCTCAGCGCCCAGCCTGGGGAATTTAAACACTCCAGCTTCCAAGAGCCAAGGAACTTCAGTGCTGTGAACTCAC 
AACTCTAAGGAGCCCTCCAAAGTTCCAGTCTCCAGGTGCTGTTACTCAACTCAGTCCTAGGAACGTCGGGTCCTG 
45 GGAAGGAGCCCAAGCGCTCCCAGCCAGCTTCCAGGCGCTAAGAAACCCCGGTGCTTCCCATCATGGTGGCCGATC 
CTCCTCGAGACTCCAAGGGGCTCGCAGCGGCGGAGCCCACCGCCAACGGGGGCCTGGCGCTGGCCTCCATCGAGG 
ACCAAGGCGCGGCAGCAGGCGGCTACTGCGGTTCCCGGGACCAGGTGCGCCGCTGCCTTCGAGCCAACCTGCTTG 
TGCTGCTGACAGTGGTGGCCGTGGTGGCCGGCGTGGCGCTGGGACTGGGGGTGTCGGGGGCCGGGGGTGCGCTGG 
CGTTGGGCCTGGAGCGCTTGAGCGCCTTCGTCTTCCCGGGCGAGCCGGAACCCAGCACACTGGAGCCACC 

50 

SEQ ID NO:37 

>GL3_8 . 66_B_M13F Direction: sense 

CTATAGGGCGAATGGGCCCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGATATCTGCAGAATTCGCCCT 
TCTCTCGGCATGGACGAGCTGTACAAGGAGGAGGCCGCCAAGGCCGGTGGCAGCGGTGGCTCCAGTGTGCTGGGT 

55 TCCGGCTCGCCCGGGAAGACGAAGGCGCTCAAGCGCTCCAGGCCCAACGCCAGCGCACCCCCGGTCCCCGACACC 
CCCAGTCCCAGCGCCACGCCGGCCACCACGGCCACCACTGTCAGCAGCACAAGCAGGTTGGCTCGAAGGCAGCGG 
CGCACCTGGTCCCGGGAACCGCAGTAGCCGCCTGCTGCCGCGCCTTGGTCCTCGATGGAGGCCAGCGCCAGGCCC 
CCGTTGGCGGTGGGCTCCGCCGCTGCGAGCCCCTTGGAGTCTCGAGGAGGATCGGTCACCATGATGGGAAGCACC 
GGGGTTTCTTAGCGCCTGGAAGCTGGCTGGGAGCGCTTGGGCTCCTTCCCAGGACCCGACGTTCCTAGGACTGAG 

60 TTGAGTAACAGCACCTGGAGACTGGAACTTTGGAGGGCTCCTTAGAGTTGCGAGTTCACAGCACTGAAGTTCCTT 
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GGCTCTTGGAAGCTGGAGTGTTTAAATTCCCCAGGCTGGGCGCTGAGGCTTCTCTGCTCTGCCCCGTGTGCCAGA 
TGTCCGAAAGCTGGGAGTTCGGAGCGCCCGGGTTCCTTGGCCCTAGGAG 

SEQ ID NO:38 

5 >GL3_8 . 66_BJVI13R Direction: anti-sense 

CTCCTAGGGCCAAGGAACCCGGGCGCTCCGAACTCCCAGCTTTCGGACATCTGGCACACGGGGCAGAGCAGAGAA 

GCCTCAGCGCCCAGCCTGGGGAATTTAAACACTCCAGCTTCCAAGAGCCAAGGAACTTCAGTGCTGTGAACTCGC 

AACTCTAAGGAGCCCTCCAAAGTTCCAGTCTCCAGGTGCTGTTACTCAACTCAGTCCTAGGAACGTCGGGTCCTG 

GGAAGGAGCCCAAGCGCTCCCAGCCAGCTTCCAGGCGCTAAGAAACCCCGGTGCTTCCCATCATGGTGACCGATC 

10 CTCCTCGAGACTCCAAGGGGCTCGCAGCGGCGGAGCCCACCGCCAACGGGGGCCTGGCGCTGGCCTCCATCGAGG 

ACCAAGGCGCGGCAGCAGGCGGCTACTGCGGTTCCCGGGACCAGGTGCGCCGCTGCCTTCGAGCCAACCTGCTTG 

TGCTGCTGACAGTGGTGGCCGTGGTGGCCGGCGTGGCGCTGGGACTGGGGGTGTCGGGGACCGGGGGTGCGCTGG 

CGTTGGGCCTGGAGCGCTTGAGCGCCTTCGTCTTCCCGGGCGAGCCGGAACCCAGCACACTGGAGCCACCGCTGC 

CACCGGCCTTGGCGGCCTCCTCCTTGTACAGCTCGTCCATGCCGAGAGAAGGGCGAATTCTGCAGATATCCATCA 

15 CACTGG 

SEQ ID NO:39 

>GL3_8 . 66_C_M13F Direction: anti-sense 

CTCCTAGGGCCAAGGAACCCGGGCGCTCCGAACTCCCAGCTTTCGGACATCTGGCACACGGGGCAGAGCAGAGAA 
20 GCCTCAGCGCCCAGCCTGGGGAATTTAAACACTCCAGCTTCCAAGAGCCAAGGAACTTCAGTGCTGTGAACTCAC 
AACTCTAAGGAGCCCTCCAAAGTTCCAGTCTCCAGGTGCTGTTACTCAACTCAGTCCTAGGAACGTCGGGTCCTG 
GGAAGGAGCCCAAGCGCTCCCAGCCAGCTTCCAGGCGCTAAGAAACCCCGGTGCTTCCCATCATGGTGGCCGATC 
CTCCTCGAGACTCCAAGGGGCTCGCAGCGGCGGAGCCCACCGCCAACGGGGGCCTGGCGCTGGCCTCCATCGAGG 
ACCAAGGCGCGGCAGCAGGCGGCTACTGCGGTTCCCGGGACCAGGTGCGCCGCTGCCTTCGAGCCAACCTGCTTG 
25 TGCTGCTGACAGTGGTGGCCGTGGTGGCCGGCGTGGCGCTGGGACTGGGGGTGTCGGGGGCCGGGGGTGCGCTGG 
CGTTGGGCCTGGAGCGCTTGAGCGCCTTCGTCTTCCCGGGCGAGCCGGAACCCAGCACACTGGAGCCACCGCTGC 

CACCGGCCTTGGCG 

SEQ ID NO:40 

30 >GL.3_8.66_CJM13R Direction: sense 

GATCACTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTCTCTCGGCATGGACGAGCTGTACAAGGAGGAGGC 
CGCCAAGGCCGGTGGCAGCGGTGGCTCCAGTGTGCTGGGTTCCGGCTCGCCCGGGAAGACGAAGGCGCTCAAGCG 
CTCCAGGCCCAACGCCAGCGCACCCCCGGCCCCCGACACCCCCAGTCCCAGCGCCACGCCGGCCACCACGGCCAC 
CACTGTCAGCAGCACAAGCAGGTTGGCTGGAAGGCAGCGGCGCACCTGGTCCCGGGAACCGCAGTAGCCGCCTGC 

35 TGCCGCGCCTTGGTCCTCGATGGAGGCCAGCGCCAGGCCCCCGTTGGCGGTGGGCTCCGCCGCTGCGAGCCCCTT 
GGAGTCTCGAGGAGGATCGGCCACCATGATGGGAAGCACCGGGGTTTCTTAGCGCCTGGAAGCTGGCTGGGAGCG 
CTTGGGCTCCTTCCCAGGACCCGACGTTCCTAGGACTGAGTTGAGTAACAGCACCTGGAGACTGGAACTTTGGAG 
GGCTCCTTAGAGTTGTGAGTTCACAGCACTGAAGTTCCTTGGCTCTTGGAAGCTGGAGTGTTTAAATTCCCCAGG 
CTGGGCGCTGAGGCTTCTCTGCTCTGCCCCGTGTGCCAGATGTCCGAAAGCTGGGAGTTCGGAGCGCCCGGGTTC 

40 CTTGGCCCTAGGAG 

SEQ ID NO:41 

>GLi3_8 . 66_D_M13F Direction: N/A 

ACTATAGGGCGAATTGGGCCCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGATATCTGCAGAATTCGCC 
45 CTTCTCTCGGCATGGACGAGCTGTACAAGGAGGAGGCCGCCAAGGCCGGTGGCAGCGGTGGCTCCAGTGTGCTGG 
GTTCCGGCTCGCCCGGGAAGACGAAGGCGCTCAAGCGCTCCAGGCCCAACGCCAGCGCACCCCCGGCCCCCGACA 
CCCCCAGTCCCAGCGCCACGCCGGCCACCACGGCCACCACTGTCAGCAGCACAAGCAGGTTGGCTCGAAGGCAGC 
GGCGCACCTGGTCCCGGGAACCGCAGTAGCCGCCTGCTGCCGCGCCTTGGTCCTCGATGGAGGCCAGCGCCAGGC 
CCCCGTTGGCGGTGGGCTCCGCCGCTGCGAGCCCCTTGGAGTCTCGAGGAGGATCGGCCACCATGATGGGAAGCA 
50 CCGGGGTTTCTTAGCGCCTGGAAGCTGGCTGGGAGCGCTTGGGCTCCTTCCCAGGACCCGACGTTCCTAGGACTG 
AGTTGAGTAACAGCACCTGGAGACTGGAACTTTGGAGGGCTCCTTAGAGTTGTGAGTTCACAGCACTGAAGTTCC 
TTGGCTCTTGGAAGCTGGAGTGTTTAAATTCCCCAGGCTGGGCGCTGAGGCTTCTCTGCTCTGCCCCGTGTGCCA 
GATGTCCGAAAGCTGGGAGTTCGGAGCGCCCGGGTTCCTTGGCCCTAGGAGCCGC 

55 SEQ ID NO:42 

>GL3_8 .66___DJM13R Direction: anti-sense 

CTCCTAGGGCCAAGGAACCCGGGCGCTCCGAACTCCCAGCTTTCGGACATCTGGCACACGGGGCAGAGCAGAGAA 
GCCTCAGCGCCCAGCCTGGGGAATTTAAACACTCCAGCTTCCAAGAGCCAAGGAACTTCAGTGCTGTGAACTCAC 
AACTCTAAGGAGCCCTCCAAAGTTCCAGTCTCCAGGTGCTGTTACTCAACTCAGTCCTAGGAACGTCGGGTCCTG 
60 GGAAGGAGCCCAAGCGCTCCCAGCCAGCTTCCAGGCGCTAAGAAACCCCGGTGCTTCCCATCATGGTGGCCGATC 
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CTCCTCGAGACTCCAAGGGGCTCGCAGCGGCGGAGCCCACCGCCAACGGGGGCCTGGCGCTGGCCTCCATCGAGG 
ACCAAGGCGCGGCAGCAGGCGGCTACTGCGGTTCCCGGGACCAGGTGCGCCGCTGCCTTCGAGCCAACCTGCTTG 
TGCTGCTGACAGTGGTGGCCGTGGTGGCCGGCGTGGCGCTGGGACTGGGGGTGTCGGGGGCCGGGGGTGCGCTGG 
CGTTGGGCCTGGAGCGCTTGAGCGCCTTCGTCTTCCCGGGCGAGCCGGAACCCAGCACACTGGAGCCACCGCTGC 
5 CACCGGCCTTGGCGGCCTCCTCCTTGTACAGCTCGTCCATGCCGAGAGAAGGGCGAATTCTGCAGATATNCATCA 
CACTGGCGG 

SEQ ID NO:43 

>gi | 5032092 | ref |NM_005628 . 1 | Homo sapiens solute carrier family 1 (neutral 

10 amino acid transporter) , member 5 (SLC1A5) , mRNA 

GTAACCGCTACTCCCGGACACCAGACCACCGCCTTCCGTACACAGGGGCCCGCATCCCACCCTCCCGGAC 
CTAAGAGCCTGGGTCCCCTGTTTCCGGAGGTCCGCTTCCCGGCCCCCAGATTCTGGCATCCCAGCCCTCA 
GTGTCCAAGACCCAGGCAGCCCGGGTCCCCGCCTCCCGGATCCAGGCGTCCGGGATCTGCGCCACCAGAA 
CCTAGCCTCCTGC^GACCTCCGCCATCTGGGGGCACTC^CCTCCTGGAGCC^AGGGCCCCACGTCCCAC 

15 CCAGAGAAACTCTCGTATTCCCAGCTCCTAGGGCCAAGGAACCCGGGCGCTCCGAACTCCCAGCTTTCGG 
ACATCTGGCACACGGGGCAGAGCAGAGAAGCTCAGCGCCCAGCCTGGGGAATTTAAACACTCCAGCTTCC 
AAGAGCCAAGGAACTTCAGTGCTGTGAACTCACAACTCTAAGGAGCCCTCCAAAGTTCCAGTCTCCAGGT 
GCTGTTACTCAACTCAGTCCTAGGAACGTCGGGTCCTGGGAAGGAGCCCAAGCGCTCCCAGCCAGCTTCC 
AGGCGCTAAGAAACCCCGGTGCTTCCCATC ATGG TGGCCGATCCTCCTCGAGACTCCAAGGGGCTCGCAG 

20 CGGCGGAGCCCACCGCCAACGGGGGCCTGGCGCTGGCCTCCATCGAGGACCAAGGCGCGGCAGCAGGCGG 
CTACTGCGGTTCCCGGGACCAGGTGCGCCGCTGCCTTCGAGCCAACCTGCTTGTGCTGCTGACAGTGGTG 
GCCGTGGTGGCCGGCGTGGCGCTGGGACTGGGGGTGTCGGGGGCCGGGGGTGCGCTGGCGTTGGGCCCGG 
AGCGCTTGAGCGCCTTCGTCTTCCCGGGCGAGCTGCTGCTGCGTCTGCTGCGGATGATCATCTTGCCGCT 
GGTGGTGTGCAGCTTGATCGGCGGCGCCGCCAGCCTGGACCCCGGCGCGCTCGGCCGTCTGGGCGCCTGG 

25 GCGCTGCTCTTTTTCCTGGTCACCACGCTGCTGGCGTCGGCGCTCGGAGTGGGCTTGGCGCTGGCTCTGC 
AGCCGGGCGCCGCCTCCGCCGCCATCAACGCCTCCGTGGGAGCCGCGGGCAGTGCCGAAAATGCCCCCAG 
CAAGGAGGTGCTCGATTCGTTCCTGGATCTTGCGAGAAATATCTTCCCTTCCAACCTGGTGTCAGCAGCC 
TTTCGCTCATACTCTACCACCTATGAAGAGAGGAATATCACCGGAACCAGGGTGAAGGTGCCCGTGGGGC 
AGGAGGTGGAGGGGATGAACATCCTGGGCTTGGTAGTGTTTGCCATCGTCTTTGGTGTGGCGCTGCGGAA 

30 GCTGGGGCCTGAAGGGGAGCTGCTTATCCGCTTCTTCAACTCCTTCAATGAGGCCACCATGGTTCTGGTC 
TCCTGGATCATGTGGTACGCCCCTGTGGGCATCATGTTCCTGGTGGCTGGCAAGATCGTGGAGATGGAGG 
ATGTGGGTTTACTCTTTGCCCGCCTTGGCAAGTACATTCTGTGCTGCCTGCTGGGTCACGCCATCCATGG 
GCTCCTGGTACTGCCCCTCATCTACTTCCTCTTCACCCGCAAAAACCCCTACCGCTTCCTGTGGGGCATC 
GTGACGCCGCTGGCCACTGCCTTTGGGACCTCTTCCAGTTCCGCCACGCTGCCGCTGATGATGAAGTGCG 

35 TGGAGGAGAATAATGGCGTGGCCAAGCACATCAGCCGTTTCATCCTGCCCATCGGCGCCACCGTCAACAT 
GGACGGTGCCGCGCTCTTCCAGTGCGTGGCCGCAGTGTTCATTGCACAGCTCAGCCAGCAGTCCTTGGAC 
TTCGTAAAGATCATCACCATCCTGGTCACGGCCACAGCGTCCAGCGTGGGGGCAGCGGGCATCCCTGCTG 
GAGGTGTCCTCACTCTGGCCATCATCCTCGAAGCAGTCAACCTCCCGGTCGACCATATCTCCTTGATCCT 
GGCTGTGGACTGGCTAGTCGACCGGTCCTGTACCGTCCTCAATGTAGAAGGTGACGCTCTGGGGGCAGGA 

40 CTCCTCCAAAATTATGTGGACCGTACGGAGTCGAGAAGCACAGAGCCTGAGTTGATACAAGTGAAGAGTG 
AGCTGCCCCTGGATCCGCTGCCAGTCCCCACTGAGGAAGGAAACCCCCTCCTCAAACACTATCGGGGGCC 
CGCAGGGGATGCCACGGTCGCCTCTGAGAAGGAATCAGTCAT GTAAA CCCCGGGAGGGACCTTCCCTGCC 
CTGCTGGGGGTGCTCTTTGGACACTGGATTATGAGGAATGGATAAATGGATGAGCTAGGGCTCTGGGGGT 
CTGCCTGCACACTCTGGGGAGCCAGGGGCCCCAGCACCCTCCAGGACAGGAGATCTGGGATGCCTGGCTG 

45 CTGGAGTACATGTGTTCACAAGGGTTACTCCTCAAAACCCCCAGTTCTCACTCATGTCCCCAACTCAAGG 
CTAGAAAACAGCAAGATGGAGAAATAATGTTCTGCTGCGTCCCCACCGTGACCTGCCTGGCCTCCCCTGT 
CTCAGGGAGCAGGTCACAGGTCACCATGGGGAATTCTAGCCCCCACTGGGGGGATGTTACAACACCATGC 
TGGTTATTTTGGCGGCTGTAGTTGTGGGGGGATGTGTGTGTGCACGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTTCTGTGACCTCCTGTCCCCATGGTACGTCCCACCCTGTCCCCAGATCCCCTATTCCCTC 

50 CACAATAACAGAAACACTCCCAGGGACTCTGGGGAGAGGCTGAGGACAAATACCTGCTGTCACTCCAGAG 
GACATTTTTTTTAGCAATAAAATTGAGTGTCAACTATTTAAAAAAAAAAAAAAAAA 

SEQ ID NO:44 

>gi | 5032093 | ref |NP_005619 . 1 | solute carrier family 1 (neutral amino acid 
55 transporter), member 5; baboon M7 virus receptor; RD114 virus receptor; 
neutral amino acid transporter B [Homo sapiens] 

MVADP PRDS KGLAAAE PTANGGL ALAS I EDQGAAAGGYCG S RDQ VllRCLRANLLVLLT VVAVVAGVALGL 
GVSGAGGALALGPERLSAFVFPGELLLRLLRMIILPLWCSLIGGAASLDPGALGRLGAWALLFF 
LASALGVGLALALQPGAASAAINASVGAAGSAENAPSKEVLD^^ 
60 RNITGTRVKVPVGQEV^GMNILGLWFAIVFGVALRKLGPEGEL^^ 
IMFLVAGKIV^MEDVGLLFARLGKYILCCLLGHAIHGLLVLPLI^ 



11 



WO 2004/019893 PCT/US2003/027523 

S S S S ATLPLMMKCVEENNGVAKHI SRFI LP IGATVNMDGAALFQCVAAVFI AQLSQQSLDFVKI ITILVT 
ATAS S VGAAG I PAGGVLTLAI I LEAVNLPVDHI SL I LAVDWLVDRS CTVLNVEGDALGAGLLQNYVDRTE 
SRSTEPELIQVKSELPLDPLPVPTEEGNPLLB^RGPAGDATVASEKESVM 



5 SEQ ID NO:45 

Chromobox homologue 6 » 
>GL3_37C_2_M13F 

ACGCAGGACGCTCTCGCTGAACTTCTTGCTCTTGCCTATAACGCGGTTCCGCGAGGGGACTTTGGGGCGGGCCAG 
CGCCCCGGCCCCCTGCCCGGGCCCCCGCCGCAGCGCCCTTTGTCGATCACCTTCAGGTCAGGATGATGCGGTCCG , 

1 0 CTTGGGCATCCCGCGNCTTACACCTTGCGGTTGATGATGCGCACCGTCTCCGAGAAGGGCGAAATGAGGCGGGCA ' 
GCAGTCCGGGGCTGCCCCCCTGCAGGGTCCGGGCGGGGGCAGGGGACGGCGGGACATACGGTTGGGCAAGNCGGG 
CCGGCGAATGTTCCTTATACTTGAAGCCCGGGTTGCCACGGGATTAGACGCCTTTGGGAGATTGCACGCATATGG 
GGCAAAGGAAGGGCNACCTTGGCCGCCTCGGGCNTTTNGACACGGAAGGNAAAATTGCCACATTCCACTTGAATT 
TGCGCCAGAATANGGGCNCCCACTTNCGTGAGCGACCACTTTGGGCCCCCCACGGGGANGGNACGCCGTTTATCA 

1 5 TCAAAGAGGGAAAAGGCNGAAAAAAAAAAGCGTTNTATCAGTAGAGGGGACAGGGGAATTATTAGGGGGGATTAC 
CCCCCCCAGTTACAATTTTACCATATTTGAGGAGGAGNCCGCCCANAGTATTAAACATAGGACGTGTTNCAAACG 
GGGACNTTACACCAGCATTANCAATTTCCGCAATTTATTATAGTATGGGGATTCNCAGAACAAANGTGGGCCTTT 
TGTCGAAAAATTTAGGAATAGGTCCCCACGG 



20 SEQ ID NO:46 

>GL3_3 5L_1_M13R Direction: ant i -sense 

ACAGCACTTGGGAGCCCGAGGAGAACATCCTGGACTCGCGGCTCATTGCAGCCTTCGAACAAAAGGAGAGGGAGC 
GTGAGCTGTATGGGCCCAAGAAGAGGGGACCCAAACCCAAAACTTTCCTCCTGAAGGCGCGGGCCCAGGCCGAGG 
■ CCCTNTCCGCATCAGTGATGTGCATTTCTCTGTCAAGCCGAGCGCCAGTGCCTCCTCGCCCAAGCTGCACTCCAG 

25 CGCAGCCGTGCACCGGCTCAAGAAGGACATCCGCCGCTGCCACCGTATGTCCCGCCGTCCCCTGCCCCGCCCGGA 
CCCGCAGGGGGGCAGCCCCGGACTGCGCCCGCCCATTTCGCCCTTCTCGGAGACGGTGCGCATCATCAACCGCCA 
AGGTGAAGCCGCGGGAGCCCAAGGCGGAACCGCATCATCCTGAACCTGAAGGTGATCGGAACAAGGGCGCTGGCA 
CGGCAGGGGGCGCCGGGCAAGGGGGCCGGGGACGCTGGCCCGCCCAAAAAAAGGNTCNCCCATCNGCGGGGAAAC 
CCGGCGGTGTAATAGGGGCAAAGAGGCAAGAACAGGTACCAGGCAAAGGAAGCCGGGTCACATGGCAGGTNCGCG 

30 GNCANAGAAACCCCAGCAACACTTGGGAGGCCACACCGGGTNGCNCAACCGGACCTTTNGGGGGGGCCTCACATC 
ACCNATGGTACCAAGGGTCGTCACCAATCGAGCACGGACAAGAAAGGGGGCGAAATTC 



SEQ ID NO:47 

>GL3_35L_3_M13R Direction: anti- sense 

35 ACAGCACTTGGGAGCCCGAGGAGAACATCCTGGACTCGCGGCTCATTGCAGCCTTCGAACAAAAGGAGAGGGAGC 
GTGAGCTGTATGGGCCCAAGAAGAGGGGACCCAAACCCAAAACTTTCCTCCTGAAGGCGTGGGCCCAGGCCGAGG 
CCCTCCGCATCAGTGATGTGCATTTCTCTGTCAAGCCGAGCGCCAGTGCCTCCTCGCCCAAGCTGCACTCCAGCG 
CAGCCGTGCACCGGCTCAAGAAGGACATCCGCCGCTGCCACCGTATGTCCCGCCGTCCCCTGCCCCGCCCGGACC 
CGCAGGGGGGCAAGCCCCGGAACTGNGCCCGCCCAATTTCGCCCTTCTCGGAGACCGGTGCGCATCATCAACCGC 

40 AAGTGAAGCCGCGGGAGCCAAGCGGAACCGATCATCCTGAAACCTGAAGGTGATCGGACAAGGGCGCTGGCGGAC 
GGGGGCGCCGGGCAGGGGGGCCGGGGGCGCCTGGGCCCCGGCCCCCCAAAANAGTCCCCCACCATCAGGGCGAGG 
GAAAACCCCGGGAGGANAAAGTAGCGGGCAAAANGAAGCCAANAGGAAANGTTCAACAAGGAGAGAAGAACGNGA 
CGACCANGGCGAGATCCCGGGGAAGAGAAACGCNCNAAGGCAAAACAAGATTGGGAGAAGCNACAACNGGTGAGC 
ACAACAGCCGGGGCCACTAGGCGGCGGGNACCTACCCTCCCCTATTGGAACGCAGCNNCGGGCANCAATTGCCGA 

45 GAAAGAAAGGGGGGCGAATTCTNCGA 



SEQ ID NO:48 

>GL3_3 5L__4_M13F Direction: N/A 

ACGCAGGACGCTCTCGCTGAACTTCTTGCTCTTGCCTATAACGCGGTTCCGCGAGGGGACTTTGGGGCGGGCCAG 
50 CGCCCCGGCCCCCTTGCCCGGCGCCCCCGCCGGCCAGCGCCCTTGTCGATCACCGTTCAGGTTCAGGATGATGCG 
GTTCCGGCTTTGGGCATCCCGGGGGGCTTCACCTTTGCNGGGTTGACTGATGCGCACCGGACATCTGAGAAAGGG 
CGAAATGGGACGGGACGCAGATCCGGGGCNTGCCCCCCCCTTGCCGGGGGGACTTCCCAGGGGCACGGGGGANGA 
CAAGGGGGGAACGGGGACGGGTGAACCAATTACCGGNGGTGGGCCAGCCGGGCGGGGAATGGGTCCCTTCTTGGA 
AGACCGGGTGCCAA 

55 

SEQIDNO:49 

>GL3_37C_1_M13F Direction: sense 

ACGCAGGACGCTCTCGCTGAACTTCTTGCTCTTGCTTATAACGCGGTTCCGCGAGGGGACTTTGGGGCGGGCCAG 
CGCCCGGCCCCCTTGCCCTGGAGCCCCCGCCGCCAGCAGGCCCATTGTCAGAATTCAACCTTTCAAGGNTNCAAG 
60 GGATGAATTGCGNGGTCCCGGCTNTATGGGGATTCCNCAGAAGGGACTTTTCAACCTTTTGCCAGGGATATGGAA 
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CGTGGNATTTGCCGGCCAAACCCCGGGGTNACTCCCCAGAAAGGAAAAAGGGGGGCGGGAAAAAATTTGGGCGCT 
GGGGGCCGACGGC 

SEQ ID NO:50 

5 >GL3_3 5L_1JVU3F Direction: N/A 

ACGCAGGACGCTCTCGCTGAACTTCTTGCTCTTGCCTTATAACGCGGTTCCGCGAGGGGACTTTGGGGCGGGCCA 
GCGCCCGGGCCCCCTTGCCCCGGGCCCCCGCCCGCCNAGCGGCCCTTTGTCGGATCAACCTTCAGGTCAGGATGA 
TGCGGTTCCGCTTTGGGCTTCCCGGCGGGTTTCACCTTGCGGGTTGATGATGCGGCAACCCGGGTCCTTCCCAGA 
AGAACAGGGGGCGGACAATTGGNGCGCAGGGGC 

10 

SEQIDNO:51 

>GL3_35L_3_M13F Direction: N/A 

ACGCAGGACGCTCTCGCTGAACTTCTTGCTCTTGCCTATAACGCGGTTCCGCGAGGGGACTTTGGGGCGGGCCAG 
CGCCCCGGCCCCCCTGCCCGGGCCCCCGCCGCCAGCGCCCTTGTCGATCACCTTCAGGTNCAGGATGATGCGGTT 
15 CCGCTTGGGCTCCCGCGGCTTCACCTTGCGGTTGATGATGCGCACCGTCTCCGAGAAGGGCGAAATGGGCGGGCG 
CAGTCCGGGGCTGCCCCCCCTTGCGGGATCCGGGCGGGGCAGGGGACCGGCGGGCACCATACGGTGGCAGCGGCG 
GGAATGTCCTTCTTGAAGCCGGTGCACGGCTTGCGCTTGAAGTTGCAGCTTGGGCCAAGG 



20 >GL3_J35L_4_M13R Direction: anti-sense 

ACAGCACTTGGGAGCCCGAGGAGAACATCCTGGACTCGCGGCTCATTGCAGCCTTCGAACAAAAGGAGAGGGAGC 
GTGAGCTGTATGGGCCCAAGAAGAGGGGACCCAAA.CCCAAAACTTTCCTCCTGAAGGCGCGGGCCCAGGCCGAGG 
CCCTCCGCATCAGTGATGTGCATTTCTCTGTCAAGCCGAGCGCCAGTGCCTCCTCGCCCAAGCTGCACTCCAGCG 
CAGCCGTGCACCGGCTCAAGAAGGACATCCGCCGCTGCCACCGTATGTCCCGCCGTCCCCTGCCCCGCCCGGACC 
25 CGCAGGGGGGCAGCCCCGGACTGCGCCCGCCCATTTCGCCCTTCTCAGAGACGGTGCGCATCATC 

SEQ ID NO:53 

>GL3_35L_PCR_G3F1 Direction: N/A 

GACGCAGGACGCTCTCGCTGAACTTCTTGGCTCTTGCCTATAACGCGGTTCCGCGAGGGGACTTTGGGGCGGGCC 
30 AGCGCCCCGGCCCCCTGCCCGGGCCCCCGCGCGGCCCTTTATACCTTCAGTTCAGATGATGCGGTTCCGCTTGGG 
CTCCCGCGGCTTCACCTTGCGGTTGATGATGCGCACCGTCTCCGAGAAGGGCGAAATGGGCGGGCGCAGTCCGGG 
GCTGCCCCCCTGCGGGTCCGGGCGGGGCAGGGGACGGCGGGACATACGGTGGCAGCGGCGGATGTCCTTCTTGAG 
CCGGTGCACGGCTGCGCTGGAGTGCAGCTTGGGCGAGGAGGCACTGGCGCTCGGCTTGACAGAGAAATGCACATC 
ACTGATGCGGAGGGCCTCGGCCTGGGCCCGCGCCTTCAGGAGGAAAGTTTTGGGTTTGGGTCCCCTCTTCTTGGG 
35 CCCATACAGCTCACGCTCCCTCTCCTTTTGTTCGAAGGCTGCAATGAGCCGCGAGTCCAGGATGTTCTCCTCGGG 
CTCCCAAGTGCTGT 

SEQ ID NO:54 

>GL3_J3 7C_1_ M13R Direction: anti -sense 

40 ACAGCACTTGGGAGCCCGAGGAGAACATCCTGGACTCGCGGCTCATTGCAGCCTTCGAACAAAAGGAGAGGGAGC 
GTGAGCTGTATGGGCCCAAGAAGAGGGGACCCAAACCCAAAACTTTCCTCCTGAAGGCGCGGGCCCAGGCCGAGG 
CCCTCCGCATCAGTGATGTGCATTTCTCTGTCAAGCCGAGCGCCAGTGCCTCCTCGCCCAAGCTGCACTCCAGCG 
CAGCCGTGCACCGGCTCAAGAAGGACATCCGCCGCTGCCACCGTATGTCCCGCCGTCCCCTGCCCCGCCCGGACC 
CGCAGGGGGGCAGCCCCGGACTGCGCCCGCCCATTTCGCCCTTCTCGGAGACGGTGCGCATCATCAACCGCAAGG 

45 TGAAGCCGCGGGAGCCCAAGCGGAACCGCATCATCCTGAACCTGAAGGTGATCGACAAGGGCGCTGGCGGCGGGA 
GGCGCCGGGCAGGGGGCCGGGNCGCTGGCCCGCCCCCAAAGTCCCCTCGCGGAAACCCGCGGTAAATAGGCAAAG 
AAGCAAGGAAGGTCAGGCGAGAGGCGGTCCCTGCGTCCGNAGAAACCCAGCACACCTGGACGGCCACCGATGCCA 
CNGGCCTGGGGGACTCCTCCTGGTACACGGTCGGCACATGCCGG 

50 SEQ ID NO:55 

>GL3_37C_2_JVI13R Direction: anti-sense 

ACAGCACTTGGGAGCCCGAGGAGAACATCCTGGACTCGCGGCTCATTGCAGCCTTCGAACAAAAGGAGAGGGAGC 
GTGAGCTGTATGGGCCCAAGAAGAGGGGACCCAAACCCAAAACTTTCCTCCTGAAGGCGCGGGCCCAGGCCGAGG 
CCCTCCGCATCAGTGATGTGCATTTCTCTGTCAAGCCGAGCGCCAGTGCCTCCTCGCCCyUVGCTGCACTCCAGCG 
55 CAGCCGTGCACCGGCTCAAGAAGGACATCCGCCGCTGCCACCGTATGTCCCGCCGTCCCCTGCCCCGCCCGGACC 
CGCAGGGGGGCAGCCCCGGACTGCGCCCGCCCATTATCAGCCCTTCTCGGAGACGGTGCGCATCATCAACCGCAA 
GGTGAAGCCGCGGGAGCCCAAGCGGAANCGCATCATCCATGAACCTGAA 



SEQ ID NO:52 
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SEQIDNO:56 

aScAGGAC^^ 

otcSgagc^^ 

tgcacatcactcatgcggagggcctcggcctgggcccgcgccttcaggaggaa^ 
ttcttgggcccatacagctcacgctccctctccttttgttcgaaggctgcaatgag 

?S 1014084 Jlref |NM 014292. l| Homo sapiens chromobox hotnolog 6 (CBX6) , raRNA 
a?tItgggctgtgggtgcc5ctgagcaagatggagctgtctgcagtc^gcgagcgggtcttcgcgg^ 

atcca?cm^ 

aagtacagxa^ 

agagggagcS 

cccaagSgcactccagcgcagccgtgcaccggctcaagaaggacatccgccgctgccaccgtatgtccc 

GGAGACGGTGCGCATCATCAACCGCAAGGTGAAGCC^ 

CTGAAGGTGATCGACAAGGGCGCTGGCGGCGGGGGCGCCGGGCAGGGGGCCGGGGCGCTGGCCCGCCCCA 
AAGTCCCCTCGCGGAACCGCGTTATAGGCAAGAGCAAGAAGTTCAGCGAGAGCGTCCTGCGTACACAGAT 

Sgccacatgaagttcggcgcctttgcgctgtacaagcctccgcccgcccccctggtagccccgtccccc 
ggcSSgSgaggcctcagccccgggccctgggctacttctggccgcccccgcc^^ 

gcag^S^ 

gct?ctccccgagaccgtgagcccatccgcccccagctggcgcgagccggaggtgctc 
2Scccgag?SgcaS^^ 

ccacggcccctgagcccgccggtgcctcctccgagcccgaggctggggactggcgccccgagatgtcacc 

ctgctccaatgtggtcgtcaccgatgtcaccagcaacctcctgacggtcacaatcaaggaattctgcaac 

cctgaggatttcgagaaggtggctgctggggtagcaggcgccgctgggggcggtggcagcattggggcga 

gcaagtgagggggctccaccaaggaggggggcttgggggggccctcctgcccgaagtcatactcotgctc 

ccaccc^cccttgcccccagccctctctccctgtgctttgcttgtctcaaatggctcggtgttgaccca 

gggatggggctgggtagttggggtcccagaaagccgggggtaggggccaccctggaatggggcaggggaa 

gggcacaccccctgcccatgcatggtagcccactgggtggtttctggaaagccctagaaactagggttcc 

tctgccccttccacatcccacctgtctctctagcttgcttcctgctctcctgtgcggcgtctgatttctc 

ggtgctaacctggcagctgtggggcccttaggagccccccaccgagggtggacacagtccctttccttcc 

tgcagatgcctaggcaggaggagggcttcctgcctgtttggcaaagtcccaggcagaggccaaggatgag 

gcctgactcggctcctccctccacatcagccagggcatcagaagttgggccagggcggggtcttccctgc 

tcgattttggacgaggcctaagtagaccccctatgccctgccccagccctggctctttcctaaccc^ 

aacggtgggaggaactggcagagggtgcgcctggcc^cagcctccccgcatctaaaggccccttcag^ 

togaccaaaggtgctacgagaacctgccgtggaaacttccagttgtgcgtctgccccactcgctgtgttt 

gaacaggggcattgaaggcctggactgcctctcgcctcagtaggcctggggaccaggcttgggtctggag 

gottgctgtggaagtcaccaggcctcccctcctggcccaggtgtgctgggggcaccgtgccccc^ 

cctgccctcctcagggtggtcagcccaacctgtcggaccttcacttcacatcatggtggggaccgagata 

raGAGGGAGACCCCATTCCAAGCTCCCTCTTCCTCCGGGTGTTTGGGGAGGATGCTGAAGAATCCATTCC 

cgagggcctcccggcttgtcccagcccctcttttgcttctgaccacggaggctttctcacagcccagcct 
gcctgaagcaaaggaggctcccgtgtcctgggcagcttctgtttccctctgctgcctgggagctgaggca 

cccotg^cag^cagaggccacagccccagccttaggccaggccctgggaggg^ 
agaccagagggtctgtgttctccaggagaatgagggtgttggtcccagaattgggaccggggccccgctg 
gccagccctgggccacttcccgggtctccattgtgcgtgggtggcgtgttccaggcgtggctggagctgg 
ottcctggctgtgctgccatgggcccctccctcagaagcacgttggcaggaggccgatcagaaccctagc 

rCCTTTGGTCCTAAGAATGGGAGGCTGCCTTCCTTCCCAATCTCCCTGCCAGGGCCCACAGCGTGGCCCT 

?gccctcccctccccgggatgtagaacggggaccctcgcagggttggggcgggggctgatactcctcggc 

rCCTCCCTACCCTGCCCTGTGTGTTGGCOTTGTGGCCGTCCAAGTGCCAATTGGCTTTTCGCCCAAATAA 
rGGCTGGTATTTCTCCTCTGTCCTTGGAGGTGATTTCCCCCTGACCCCCTCCCCCAGGTGAGTGACCACC 
TGGGTGCCAGTTACAGGTGTTTCCAGAGACCATAGAAATGTGTTTTCCTGAGAGTTCGTGTC^TTCGTGA 
CTTTTTTGTAAAGAAGTTGTGTTTTCAGAGGTGATTTTATGACAGGAAAGTGAAAGAATTAGTTTTGCAA 

S^caaaaacaaaaaaagaggaaaaaaa^ 
gtggcgggggagaaagagctatttaagaaaaaatagtaacgcagt 
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CAGGATGGGGCGGAGGCCTGGGCCCAGCTGGCAGGTCCCCTGGCATCGCAGGCACTGTGGAGAGGGCCTG 
GACCCAGATCTCCACACCCGTGCTTGCTCAAAGGGAAGGACAACAGCGGGCCCCCGGGAGCTAACCCAAG 
CTGCAGGTCCCGGCAAGCTGAGGTTTGGGAGGGTGGGGGTTGTCACTGGTGATTTTCTCCAGGGGGCTGG 
TGAGTGGGCAGTTTGGTTTCTTGCCCCCTTCTGTTCCTTTCCCAGTTGTTGGGCCATCTGGTCCCCACCA 
5 CCGCCACCCTATGGGGGAGACCTCCCTCCCCACGGGTCACCCTAAAGCCCACAACCTCTCTGAGCCTCCC 
TGGCCTGAAAGGGGATGCAGGCTTCAGGAGGCAAGAAGCTGGGCCCCTGGGGGTGGCTGGGGAGAGGGAA 
TGCATTTCCCTTGCCACAGGTGGTCTGCTTCTGCTGGCCTGAGCTCCAAGTGGAGCAGCCCGGGCCAGCC 
TTGGTGCATGAAGAGGCACCAGGCACACCGCCTTGAGGTGGGCAGTGCCCATGGGGGCCCGAGTGGATGG 
GACCGAGGGTGAGTGGAGCCTCCTTCCTCCCCTCTCTAGTACCCCCGCCTCCCACACACTTGCACGGATC 

10 GGCCTCCCTTGGGAGATCAGCCTCCATGGGCCCCTCGTCCACCCTTGCTGCTTTCCATTTGCCTAATTAC 
CAAGCAGAAGTTGCAATCTGGTTTGCTTTATTTTTGTATGTGAAATAACCCCCAAAGCCCAATCTCCTCC 
TACGTTCAATATTGGTTGGGGCATCCGTCATCTCCCCTTAAGTGCGCCCCCTCCCCACCCAAGTATCATA 
GGAAACCGGTGAGGTCTGGTGTCTCTGGTTTGAGACGGTAAGTTGGGACCCATCCCTGTCTGGGTGCCCA 
CTCTGACCTTTAGTTTGCCCTTCTGTGAAATGGGTGTATTGGGTAGCAAGCCCTCTTCAGAAAGCGCTGC 

15 TGGTGTCAGAGCAGCTGCCCAGTACCAGGTGGGGGGTCAAGGTTGCTGGTACTGGGGCCCCCAGCTGCCC 
ACAACCCCTCTTTGTTCTCACCCTGCAAAGGGGTCT^AGGTCAAAATGAGCCTCATCCTTCCTATGATCTG 
GGAAGAGGTGATGATCAAGTCCCCAACTTCAGTGTGAGGTGGACAGAGTTGGGGGGATGGCCCCTTTTTG 
AAGAGGTGAAAATGGTTTTGGAGAAGCGCAGCTGCTTCACTGGGGGAATGCGGCAGGGACTGGGGCCCAG 
GATGCTTTGGCCTATGGGGAAAAGCCCTTTAAAAGG^ 

20 CCTCGAGCCCCTGAGCCAGGAGGTCCTGGAGGAGAGCCAGGCCGGTGGGCCCGCCCAAGGCTGGAGGGTC 
AGCCCCAACAGGGAGCTGGGTTGGCCAGGGGGCTGGACTGCTACCAGCCTCTCTGGCCTATGGGGACCCA 
AGAGGACACATCCCCCTTTTGCCCACTCTTCTGTGTCATTTTGTTGTTTTGGTTTGTGGTGGTTTTTCTT 
TTTTCTTTTGTTTTTCTTTTTTCTTTCTTTCTTTTTTTTTTTTT1U W ITTTTTTTTTTTGCACTTCGCCCA 
CACAGGACAGTGGAGCCCCACCTGGTCAGTTCCACTTCCGGGCTCCCATGCACTTGCCCAAGGCGGCCTC 

25 TTTGGGACGGGGATGGTTTGAGGAAACACTTTTAAAGAAAAAAGGAAGACATTGAAAGGTTTTAGTTTCT 
TCCCTATCTGCATGTCCTCTCATATAGAAAGCCCAGAATTAGGGGCTAGAACTCCAGGAGAGGGTCTCCC 
CGACTCATCTCTTGCTGACGGTCACCAGGATGCAGAAATAGGGAGATGGTTAGTGGGGGCCAAAGATGCC 
CCCTCCCAGGCCTTCGTGGTTCCCTCCTCCGCCCCCTGCAATCTTTGGAGGAGTCAGTGCCTCACTCCAG 
CAGTGAGTGCCTACTGTATGCAGGTAGTCAGCCAGGCAAAGAGAGACTAACGGTCTCATGGGGGAACCCT 

30 CTTGCGGGAGGCCGGGTAGCTGGAGCGAAGCGTTCCGGCTGCCCTTGCTGCTGGGTGGAGTGGAGAGGGA 
GACTTCTTTTTGTTGGTTTTAATTTAAAAACACAAAGGCCTAAAGAAATACGTATCTTATAATTTTTTTA 
ATTTTTGAGACGTTCATTTAATGAATTGTGCACGAATGAATTCTATATATATAAAATATACATATATAGC 
TCTATATTTGGGGAGGGGCACTGTCTCTTTTTTCTCTCATTTTTAAAATGAAGTGTTGTTGCCTTTGTAT 
GTGGTTCAACCATCCAGCTCCCAGCTGGCTAAACTTTGCCTCCAGTGGTCAAAGATGGGAAAAGAGTGGG 

35 GTTGGCAGGAGATGGAAAACGGAGGTGCCGCCCCAGCATGGGGGGCAGGTCCCCCAGTCCACCCTGCCCC 
TCCCCCTGTGGAGAAGACGCTTAGTTGGGGGTGTGGGTTTGGGCTCCATTCTGGATTCGGCGGTTCCGGG 
GGAGGGGTGGGTCTGTGCCGATTACTCTGTCTTGTACGTTTGTTCTGCTGCTCTTCAATATTGTATCAAC 
GCCAGGAAAGGGGGGTGA^lAAGCCTCTTTTACCCCCCAAATAAATTGTCACATTCCGAAGCTGA 

40 SEQ ID NO:58 

>gi| 10140849) ref |NP_055107 .l| chromobox homolog 6 [Homo sapiens] 
MELSAVGERVFAAESIIKRRIRKGRIEYLVKWKGWAIKYSTWEPEENILDSRLIAAFEQKERERELYGPK 
KRGPKPKTFLLKARAQAEALRI SD VHFS VKPS AS AS S PKLHS S AAVHRLKKD IRRCHRMSRRPLPRPDPQ 
GGSPGLRPPISPFSETWIINRKVKPREPKRNRIILNLK^^ 

45 KSKKFSESVLRTQIRHMKFGAFALyKPPPAPLVAPSPGKAEASAPGPGLIiLAAPAAPYDARSSGSSGCPS 
PTPQSSDPDDTPPKLLPETVSPSAPSWREPEVLDLSLPPESAATSKRAPPEVTAAAGPAPPTAPEPAGAS 
SEPEAGDWRPEMSPCSfcT\A7VTDVTSNLLTVT^ 

SEQIDNO:59 
50 Cytochrome C Oxidase subunit 1 

>GL2_226_2_1_M13R Direction: sense 

ACTTTCTCGGCCTATCCGGAATGCCCCGACGTTACTCGGACTACCCCGATGCATACACCACATGAAACATCCTAT 
CATCTGTAGGCTCATTCATTTCTCTAACAGCAG 

55 SEQIDNO:60 

>GL»2_226_2_2_M13F Direction: anti-sense 

CTGCTGTTAGAGAAATGAATGAGCCTACAGATGATAGGATGTTTCATGTGGTGTATGCATCGGGGTAGTCCGAGT 
AACGTCGGGGCATTCCGGATAGGCCGAGAAAGT 
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SEQIDNO:61 

>GIi2_226_J2_2_M13R Direction: sense 

ACTTTCTCGGCTATCCGGAATGCCCCGACGTTACTCGGACTAACCCGATGCATACACCACATGAAACATCCTATN 
ATACTGTAGGCTCCATTCATTTCTCTNACAGCAGA 

SEQ ID NO:62 

>GL2_226_2_4_M13F Direction: anti-sense 

CTGCTGTTAGAAGAAATGAATGAGCCTTACAGATGATAGGATGTTTCATGTGGTGTATGCATCGGGGTAGTCCGA 
GTAACGTCGGGGCATTCCGGATAGGCCGAGAAAGT 

SEQ ID NO:63 

5905-7446 
/gene="COXl" 
/ codon_s t art = 1 
/transl_table=2^ 

/products "cytochrome c oxidase subunit I" 

5881 agccatttta cctcaccccc act gatg ttc gccgaccgtt gactattctc tacaaaccac 
5941 aaagacattg gaacactata cctattattc ggcgcatgag ctggagtcct aggcacagct 
60 01 ctaagcctcc ttattcgagc cgagctgggc cagccaggca accttctagg taacgaccac 
6061 atctacaacg ttatcgtcac agcccatgca tttgtaataa tcttcttcat agtaataccc 
6121 atcataatcg gaggctttgg caactgacta gttcccctaa taatcggtgc ccccgatatg 
6181 gcgtttcccc gcataaacaa cataagcttc tgactcttac ctccctctct cctactcctg 
6241 ctcgcatctg ctatagtgga ggccggagca ggaacaggtt gaacagtcta ccctccctta 
63 01 gcagggaact actcccaccc tggagcctcc gtagacctaa ccatcttctc cttacaccta 
6361 gcaggtgtct cctctatctt aggggccatc aatttcatca caacaattat caatataaaa 
6421 ccccctgcca taacccaata ccaaacgccc ctcttcgtct gatccgtcct aatcacagca 
6481 gtcctacttc tcctatctct cccagtccta gctgctggca tcactatact actaacagac 
6541 cgcaacctca acaccacctt cttcgacccc gccggaggag gagaccccat tctataccaa 
6601 cacctattct gatttttcgg tcaccctgaa gtttatattc ttatcctacc aggcttcgga 
6661 ataatctccc atattgtaac ttactactcc ggaaaaaaag aaccatttgg atacataggt 
6721 atggtctgag ctatgatatc aattggcttc ctagggttta tcgtgtgagc acaccatata 
67 81 tttacagtag gaatagacgt agacacacga gcatatttca cctccgctac cataatcatc 
6841 gctatcccca ccggcgtcaa agtatttagc tgactcgcca cactccacgg aagcaatatg 
6901 aaatgatctg ctgcagtgct ctgagcccta ggattcatct ttcttttcac cgtaggtggc 
6961 ctgactggca ttgtattagc aaactcatca ctagacatcg tactacacga cacgtactac 
7021 gttgtagctc acttccacta tgtcctatca ataggagctg tatttgccat cataggaggc 
7081 ttcattcact gatttcccct attctcaggc tacaccctag accaaaccta cgccaaaatc 
7141 catttcacta tcatattcat cggcgtaaat ctaactttct tcccacaaca ctttctcggc 
72 01 ctatccggaa tgccccgacg ttactcggac taccccgatg catacaccac atgaaacatc 
7261 ctatcatctg taggctcatt catttctcta acagcagtaa tattaataat tttcatgatt 
7321 tgagaagcct tcgcttcgaa gcgaaaagtc ctaatagtag aagaaccctc cataaacctg 
7381 gagtgactat atggatgccc cccaccctac cacacattcg aagaacccgt atacataaaa 
7441 tctagacaaa 

SEQ ID NO:64 

>gi 1 17981855 | ref |NP_536845 . 1 | cytochrome c oxidase subunit I [Homo sapiens] 

MFADRWLFSTNHKDIGTLYLLFGAWAGVLGTALSLL^ 

MPIMIGGFGIWIiVPLMIGAPDMAFPRMNl^ 

ASVDLTIFSLHIAGVSSILGAINFITTIINMKP 

TDRl^NTTFFDPAGGGDPILYQHLFWFFGHPEVYILILPGFGM 

GFLGFIWAHHMFTVGMDVDTRAYFTSATMIIAIPTGVKV 

GGIiTGIVLANSSLDIVLHDTYYWAHFHYVLSM 

VNLTFFPQHFLGLSGMPRRYSDYPDAYTTWNILSSVGSFISLT 

NLEWLYGCPPPYHTFEEPVYMKS 

SEQIDNO:65 

Delta-5 Fatty Acid Desaturase ( FADS 1 ; FADSD5) 
>GL1_10_3_ G3F1 Direction: sense 

GGCCTGAACTGNACTGAGCAGCACCGCACANAGGAGGAAGGGCAAAAAGGACGTCCCAAAGACCCAAAGGGTGAG 
CCAGNCTGNACCATCCAGCAGCAAGATGTGCAGACAGGTACAGNAGGAAGAANACATGGTTGGCCTTCATGAGCC 
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CCATCCGCTCCACTGTGGNCCGCANACTCCCGGAACTCATCTGTCAGCTCTTTATTCTTGGTGGGOTCAAAGCTG 
GGCTGOTCTGGAGACAGTTCTCCAATC^GGAGAGAGTTCATATACTTCTTCACAAGGCCCTTGTGGATGTGGAAG 
GCCACAAAGGGATCCGTGGNATCCTGNCCGGNGTAGTGGCTGATGAITCCGGGAGCCCCCTGNATGCCGGCGGAGT 
GAACTCGCTGATGTTGTACACCTTACGGTCGATCACTANCCACCGCTCCT 

5 

SEQ ID NO:66 

>GL1_10_3 — G3R1 Direction: anti- sense 

CAGACGCTCAAGGATGCGAGGAGCGGTGGCTAGTGATCGACCGNTAAGGTGTACAACATCAGCGAGTTCACCCGC 
CGGCATCCAGGGGGCTCCCGGGTCATCAGCCACTACGCCGGGNAGGATGCCACGGATCCCTTTGTGGCCTTCCAC 
10 ATCAACAAGGGCCTTGCTGAAGAAGTATATGAACTCTCTCCTGATTGGAGAACTGTCTCCAGAGCAGCCCAGCTT 
TGAGCCCACCAAGAATAAAGAGCTGACAGATGAGTTCCGGGAGCTGCGGGCCACAGTGGAGCGGATGGGGCTCAT 
GAAGGCCAACCATGTCTTCTTCCTGCTGTACCTGCTGCACATCTTGCTGCTGGATGGTGCAGCCTGGCTCACCCT 
TTGGGTCTTTGGGACGTCCTTTTTGCCCTTCCTCCTCTGTGCGGTGCTGCTCAGTGCAGTTCAGGCC 

15 SEQIDNO:67 

delta5 -Fatty acid desaturase> 

ATGGCCCCCGACCCGTTGGCCGCCGAGACCGCGGCTCAGGGACTTACCCC 

GCGCTACTTCACCTGGGACGAGGTGGCCCAGCGCTCAGGGTGCGAGGAGC 

GGTGGCTAGTGATCGACCGTAAGGTGTACAACATCAGCGAGTTCACCCGC 
20 CGGCATCCAGGGGGCTCCCGGGTCATCAGCCACTACGCCGGGCAGGATGC 

CACGGATCCCTTTGTGGCCTTCCACATCAACAAGGGCCTTGTGAAGAAGT 

ATATGAACTCTCTCCTGATTGGAGAACTGTCTCCAGAGCAGCCCAGCTTT 

GAGCCCACCAAGAATAAAGAGCTGACAGATGAGTTCCGGGAGCTGCGGGC 

CACAGTGGAGCGGATGGGGCTCATGAAGGCCAACCATGTCTTCTTCCTGC 
25 TGTACCTGCTGCACATCTTGCTGCTGGATGGTGCAGCCTGGCTCACCCTT 

TGGGTCTTTGGGACGTCCTTTTTGCCCTTCCTCCTCTGTGCGGTGCTGCT 

CAGTGCAGTTCAGGCCCAGGCTGGCTGGCTGCAGCATGACTTTGGACACC 

TGTCGGTCTTCAGCACCTCAAAGTGGAACCATCTGCTACACCATTTTGTG 

ATTGGCCACCTGAAGGGGGCCCCCGCCAGTTGGTGGAACCACATGCACTT 
3 0 CCAGCACCATGCCAAGCCCAACTGCTTCCGCAAAGACCCAGACATCAACA 

TGCATCCCTTCTTCTTTGCCTTGGGGAAGATCCTCTCTGTGGAGCTTGGG 

AAACAGAAGAAAAATTATATGCCGTACAACCACCAGCACAAATACTTCTT 

CCTAATTGGGCCCCCGGCCTTGCTGCCTCTCTACTTCCAGTGGTATATTT 

TCTATTTTGTTATCCAGCGAAAGAAGTGGGTGGACTTGGCCTGGATGATT 
3 5 ACCTTCTACGTCCGCTTCTTCCTC ACTTATGTGCC ACTATTGGGGCTGAA 

AGCCTTCCTGGGCCTTTTCTTCATAGTCAGGTTCCTGGAAAGCAACTGGT 

TTGTGTGGGTGACACAGATGAACCATATTCCCATGCACATTGATCATGAC 

CGGAACATGGACTGGGTTTCCACCCAGCTCCAGGCCACATGCAATGTCCA 

CAAGTCTGCCTTCAATGACTGGTTCAGTGGACACCTCAACTTCCAGATTG 
40 AGCACCATCTTTTTCCCACGATGCCTCGACACAATTACCACAAAGTGGCT 

CCCCTGGTGCAGTCCTTGTGTGCCAAGCATGGCATAGAGTACCAGTCCAA 

GCCCCTGCTGTCAGCCTTCGCCGACATCATCCACTCACTAAAGGAGTCAG 

GGCAGCTCTGGCTAGATGCCTATCTTCACCAATAA 

45 SEQ ID NO:68 

>gi 1 14141179 | ref |NM_013402 . 3 | Homo sapiens fatty acid desaturase 1 (FADS1) , 
mRNA 

TCCACTCCTGGAGCCCGCGGACCCCGAGCACGCGCCTGACAGCCCCTGCTGGCCCGGCGCGCGGCGTCGC 
CAGGCCAGC TATG GCCCCCGACCCGGTGGCCGCCGAGACCGCGGCTCAGGGACCTACCCCGCGCTACTTC 

50 ACCTGGGACGAGGTGGCCCAGCGCTCAGGGTGCGAGGAGCGGTGGCTAGTGATCGACCGTAAGGTGTACA 
ACATCAGCGAGTTCACCCGCCGGCATCCAGGGGGCTCCCGGGTCATCAGCCACTACGCCGGGCAGGATGC 
CACGGATCCCTTTGTGGCCTTCCACATCAACAAGGGCCTTGTGAAGAAGTATATGAACTCTCTCCTGATT 
GGAGAACTGTCTCCAGAGCAGCCCAGCTTTGAGCCCACCAAGAATAAAGAGCTGACAGATGAGTTCCGGG 
AGCTGCGGGCCACAGTGGAGCGGATGGGGCTCATGAAGGCCAACCATGTCTTCTTCCTGCTGTACCTGCT 

55 GCACATCTTGCTGCTGGATGGTGCAGCCTGGCTCACCCTTTGGGTCTTTGGGACGTCCTTTTTGCCCTTC 
CTCCTCTGTGCGGTGCTGCTCAGTGCAGTTCAGGCCCAGGCTGGCTGGCTGCAGCATGACTTTGGGCACC 
TGTCGGTCTTCAGCACCTCAAAGTGGAACCATCTGCTACATCATTTTGTGATTGGCCACCTGAAGGGGGC 
CCCCGCCAGTTGGTGGAACCACATGCACTTCCAGCACCATGCCAAGCCCAACTGCTTCCGCAAAGACCCA 
GACATCAACATGCATCCCTTCTTCTTTGCCTTGGGGAAGATCCTCTCTGTGGAGCTTGGGAAACAGAAGA 

60 AAAAATATATGCCGTACAACCACCAGCACAAATACTTCTTCCTAATTGGGCCCCCAGCCTTGCTGCCTCT 
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CTACTTCCAGTGGTATATTTTCTATTTTGTTATCCAGCGAAAGAAGTGGGTGGACTTGGCCTGGATGATT 
ACCTTCTACGTCCGCTTCTTCCTCACTTATGTGCCACTATTGGGGCTGAAAGCCTTCCTGGGCCTTTTCT 
TCATAGTCAGGTTCCTGGAAAGCAACTGGTTTGTGTGGGTGACACAGATGAACCATATTCCCATGCACAT 
TGATCATGACCGGAACATGGACTGGGTTTCCACCCAGCTCCAGGCCACATGCAATGTCCACAAGTCTGCC 
5 TTCAA.TGACTGGTTCAGTGGACACCTCAACTTCCAGATTGAGCACCATCTTTTTCCCACGATGCCTCGAC 
ACAATTACCACAAAGTGGCTCCCCTGGTGCAGTCCTTGTGTGCCAAGCATGGCATAGAGTACCAGTCCAA 
GCCCCTGCTGTCAGCCTTCGCCGACATCATCCACTCACTAAAGGAGTCAGGGCAGCTCTGGCTAGATGCC 
TATCTTCACCAATAACAACAGCCACCCTGCCCAGTCTGGAAGAAGAGGAGGAAGACTCTGGAGCCAAGGC 
AGAGGGGAGCTTGAGGGACAATGCCACTATAGTTTAATACTCAGAGGGGGTTGGGTTTGGGGACATAAAG 
10 CCTCTGACTC?VAACTCCTCCCTTTTATCTTCTAGCCACAGTTCTAAGACCCAAAGTGGGGGGTGGACACA 
GAAGTCCCTAGGAGGGAAGGAGCTGTTGGGGCAGGGGTGTAAATTATTTCCTTTTTCTAGTTTGGCACAT 
GCAGGTAGTTGGTGAACAGAGAGAACCAGGAGGGTAACAGAAGAGGAGGGACCTACTGAACCCAGAGTCA 
GGAAGAGATTTAACACTAAAATTCCACTCATGCCGGGCGTGGTGGCACGCGCCTGTAATCCCAGCTACCC 
AGGAGGCTGAGGCAGGAGAATCGCTTGAACCGGGGAGGTGGAGGTTGCAGTGAGCTGAGATCACGCCATT 
1 5 GTACTCCAGCCTGGGCGACAGAGCAAGACTCGATTTCAAAAAAAAAAAAAAAATCCACTCATATAAAAGG 
TGAGCTCAGCTCACTGGTCCATTTCTCAGTGGCTTCTCCATCCTCATTTGCAAACCTCAGAGGGATAAGG 
CAGTTGAACCTGATGAGCAAGAATTATAACAGCAAGGAAACATTAATGCTTAGAATTCTGAGATCCAGCA 
CAACTCAGTCTGTGGGAGCTCAGCTCGCTGCCCAGGGATAGGTATGACCTATGTCTGCCTTAGGCTGCTG 
GGAGATGCCATTCTCCAGTTTCAGAAGCAGGCAGGGCAAAGGTCAAGACTGTGGTATTGGGGTCTTTTGG 
20 CTCTGAAGGATCCTGGAACCACTGATTTTGGTTTATTCCCTCCAGGGTCTAAAGAGAACAAGAGGTGCTA 
GCTCTTACCAAAACAGATGGTAGAGAGAGTTGCTGGCTATTTAAAAAGCTCTTTCATCTTTTAATTCACC 
TCTTCTTTTCACCTCTTTAACCACTCCTCAGGAACAGAACACTTCTAGGACTGGGGGTCTTTTAGCTCCA 
TAAGCAAGTGAGCAGATGGGACAAGTTAGTCTTTTCTCCCTAGAAACAAAGGGGATGCCCAGTGGTTTCC 
GTTTGCTTCCCAACCTAAAATTTCAAGTTTAATAAAATAGCAATTAGCAGAAGTGACCAAATTGGGAGAT 
25 AATTATCAGTCATGAGGAAAGACACAGATTTCGGTCATAAAGAATGTAAGGGCTATAAGTAGAAACTTTC 
TATAACCTAAATGATGTTATAGAATTATTTTTGAGCAGGAGCAGAAAGATTAAATATGATCACTTCATAC 
TTCTAAATCAGAAATAGGAAGATTAAAACCACAGAACAGTTTGTGATTTCTATTGCTGTAGCTAGGTATC 
TTACTCTGTCCACTCTTGTTCAAGTATCTAACTCTTCTGGAAACCAAATAGGCTTTAGAAGAGATTATCC 
TATATTCCTATCAGTATAATACTAAAATGTAACTTTTTAATCATCTGGTTTTTAAAAGATAAACAGTTTA 
30 GCCCATCTCTCCAGAGAGCAAACATAGGAATATGACTCAGGAGCCTCCTAGGGCTTATCATCAGCCCTCA 
CACCCGCTTCCCCCTCCAACCCACAGCCTTTGCTTCCAGGTGGCAGGATTACTACTTTGCCTCTTCAGCA 
GCATCTACTCTAGGCATATTGATCATTTTAGACACTGGGAGAAGAGAACCTCAAACTAGGAGGAAAAGAC 
AGAGCCTCCACTTAGTTTTGGGAGGGGATGGCAGACAGTCAAGGAGATGAGCGTCCTAAGGCATGTTGGG 
ATAGGGTCAGATGCACCACCCATGGAGAGGTTTGTCAACACAAAGACATGGAAGGTTAGAGGTTTGTCAA 
35 CAAAAAGACATGGAAGGTTAGGTTTGTCAACACAAAGACATGGAAGATTAGAGGTTTGTCAACACAAAGA 
CACAGGAAGAATGGGCTGCAGAAGATTTAGATGTTTTCCATTTGGGCACATTTTACTTAGCTGGAGAACT 
AGGTTTAAAACAGCCTGGGTAGGAAAATTAGAAGCAAGCTGGATGCAGTGGCTCATGCCTGTAATCCCAA 
CACTTTTGGGAGGTCCAGGCAGGAGGATCACTTGGGCCCAGGAGGTCAAGCCTGCAGCGAGCTGAGATCA 
CACCACTGCACTCCAGCCTGGGGTGATAGAACAAGACCCTGTCTCAAAAAAAAAAAAAAACAACAAAAAC 
40 TTAGAATTGAGGAGTTGTACCTCCATTGGCTTCCTCACTCCAAAATAGGTGCTGATCCTTCCTATTCCTA 
TTCTTTGCCACCTTTTGGGTGTGGTGTCACCAGCCTGTTTAGCCAAGTAGCTTTGGGCATAGGCTGCCCA 
ATCTGAGCAAACACCAGTGAGGCTCTATTGAGCCAAGACCAAGTCCTCAAAGCACCTGAACCACTGTGGC 
CTTCTCAGCCTACAGCAGTGTGGTCTCTTACATGGCCACAAAGGGACACACAGTGACAAAAGGCTCGGAA 
TGTTACAATGGTAAAATGAGTGATCTCAAATCCACTGACAGATATAAAATAGGCTTAGAGAGGAAAAGCT 
45 GCCTCTGGTCAAGTAGATCATGGCAGCATGAATTCCAACTCACTTTTTTACAACTCCAACTTCTATGTTT 
ATCTTTGTTACTTTCACTTTTTTACAACCTGGCCAGAGGCATTTTTTAAATCAGGCCCAATATCAGTATT 
CTTTTTGTGTGTGCCAATTTTGTTATCACATCCCTATGAAGTTGAAAAATAAAGTTAATTTTGACCAAAA 
AAAAAAAAAAAAG 

50 SEQ ID NO:69 

>gi | 11181775 | ref |NP_037534 . 2 | fatty acid desaturase 1; delta-5 fatty acid 
desaturase; linoleoyl-CoA desaturase (delta-6-desaturase) -like 1; delta-5 
desaturase [Homo sapiens] 

MAPDPVAAETAAQGPTPRYFTWDEVAQRSGCEERWLVIDRKVYNISEFTKJ^PGGSRVISHYAGQDATDP 
55 FVAFHINKGLVKKYMNSLLIGEI^^ 

IiLDGAAWLTLWVFGTSFLPFLLCAVLLSAVQAQAGWLQHDFG 
WnSTHMHFQHHAKPNCFRKDPDINMHPFFFi^ 
WYIFYFVIQRKKWVDLAWMITFYVRFFLTYVPL^ 
RNMDWSTQLQATCNVHKSAFITO^ 
60 S AFAD I IHSLKESGQLWLDAYLHQ 
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SEQ ID NO:70 
Dynactin 6, p27 subunit 

>gi | 18426895 | ref |NM_006571 . 2 | Homo sapiens novel RGD- containing protein 
(WS-3) , mRNA 

5 CAACCCTGCCAGGCTCTCCAATCGCATGTGGAATTATCGCTCTACCCAGGCGGTGGTGTCGATCTACGTT 
CCAATTGGGGCCGTACCATGGCGGAGAAGACTCAAAAGAGTGTGAAGATTGCTCCTGGAGCAGTTGTATG 
TGTAGAAAGTGAAATCAGAGGAGATGTAACTATCGGACCTCGGACAGTGATCCACCCTAAAGCAAGAATT 
ATTGCGGAAGCCGGGCCAATAGTGATTGGCGAAGGGAACCTAATAGAAGAACAGGCCCTTATCATAAATG 
CTTACCCAGATAATATCACTCCTGACACTGAAGATCCAGAACCAAAACCTATGATCATTGGCACCAATAA 

10 TGTGTTTGAAGTTGGCTGTTATTCCCAAGCCATGAAGATGGGAGATAATAATGTCATTGAATCAAAAGCA 
TATGTAGGCAGAAATGTAATATTGACAAGTGGCTGCATCATTGGGGCTTGTTGCAACCTAAATACATTTG 
AAGTCATCCCTGAGAATACGGTGATCTATGGTGCAGACTGCCTTCGTCGGGTGCAGACTGAGCGACCGCA 
GCCCCAGACACTACAGCTGGATTTCTTGATGAAAATCTTGCCAAATTACC^CCACCTAAAGAAGACTATG 
AAAGGAAGCTCAACTCCAGTAAAGAACTAAGAACAGTGTATAACATGAAGATAACATTTTGTCTTTGACC 

1 5 ACTGTCTTTTGAATGGGCCCACAGTGTTTATGTACTC 

ATTTTGTAAAATTGGGTTGAGAGGAAACTAATGGAGTTTCATTGTAACTGTCCTTTGTAATTTATATAAA 
TGTATTATTTTCCTATATCCTTGGTTCTTTTCTGATAATTTACAGATTTAGCTTTTCTTTTGTTATATAA 
ACTGCTAGCCACAAATTTTAGTTATGTAAAAGGCTACCCTTGACAAGAAAAGACATACTGTCATGTATTT 
ATATTCTAGCATAGACTAAACTGAATAAAAATGCTGATAACAGGACCTTTAAAAAAAAAAAAAA 

20 

SEQIDNO:71 

>gi | 5730116 | ref |NP_006562 . 1 ] novel RGD- containing protein [Homo sapiens] 
MAE KTQKS VKI APGAWCVE S E I RGDVT I GPRTVI HPKAR I IAE AGP I VI GEGNL I EEQAL I INAYPDN I 
TPDTEDPEPKPMI IGTN]^FEVGCYSQAMKMGDNNVIESKA PEN 
25 TVIYGADCLRRVQTERPQPQTLQLDFLMKILPNYHHLKKTMKGSSTPVKN 



SEQ ID NO:72 

Elongation factor 1 alpha (EFla) 
>GL1_34_IjOW_3_G3F1 Direction: sense 
30 ACTGTNCCTGCTTGGCCGAGTGGAGACTGGTGTTCTCAAACCCGGATATGGTGGTCACCTTTGCTCCAGCTCAAC 
GTTACAACGGAAGTAAAATCTGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGC 
TTCAATGTCAAGAATGTGTCTGTCAAGGATGTTCGTCGTGGCAACGTNGCTGGTGACAGCAAAAATGACCCACCA 
ATGGAAGCAGCTGGCTTCACTGCTCAGGTGATTATCCTGAACCATCCAGGCCAAATAAGCGCC 



35 SEQ ID NO:73 

>GIj1_34_L0W_3_G3R1 Direction: anti-sense 

GGCGCTTATTTGGCCTGGATGGTTCAGGATAATCACCTGAGCAGTGAAGCCAGCTGCTTCCATTGGTGGGTCATT 
TTTGCTGTCACCAGCAACGTTGCCACGACGAACATCCTTGACAGACACATTCTTGACATTGAAGCCCACATTGTC 
CCCAGGAAGAGCTTCACTCAAAGCTTCATGGTGCATTTCGACAGATTTTACTTCCGTTGTAACGTTGACTGGAGC 
40 AAAGGTGACCACCATACCGGGTTTGAGAACACCAGTCTCCACTCGGCCAACAGGAACAGT 



SEQ ID NO:74 

>GLl_34_LOW_4__G3Fl Direction: sense 

ACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGTGGTCACCTTTGCTCCAGTCAACGTT 
45 ACAACGGAAGTAAAATCTGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTC 
AATGTCAAGAATGTGTCTGTCAAGGATGTTCGTCGTGGCAACGTTGCTGGTGACAGCAAAAATGACCCACCAATG 
GAAGCAGCTGGCTTCACTGCTCAGGTGATTATCCTGAACCATCCAGGCCAAATAAGCGCC 



SEQ ID NO:75 
50 >GL1_34JL0W__4_G3R1 Direction: anti- sense 

GGCGCTTATTTGGNCTGGATGGTTCAGGATAATCACCTGAGCAGTGAAGCCAGCTGCTTCCATTGGTGGGTCATT 
TNTGNTGTCACCAGCAACGTTGCCACGACGAACATCCTTGACAGACACATTCTTGACATTGAAGCCCACATTGTC 
CCCAGGAAGAGCTTCACTCAAAGCTTCATGGTGCATTTCGACAGATTTTACTTCCGTTGTAACGTTGACTGG 
AAAGGTGACCACCATACCGGGTTTGAGAACACCAGTCTCCACTCGGCCAACAGGAACAGT 

55 

SEQ ID NO:76 

>gi | 20127674 | ref |NM_001402 . 3 | Homo sapiens eukaryotic translation 
elongation factor 1 alpha 1 (EEF1A1) , mRNA 

GGCACGAGGGTTTGCCGCCAGAACACAGGTGTCGTGAAAACTACCCCTAAAAGCCAA AATGG GAAAGGAA 
60 AAGACTCATATCAACATTGTCGTCATTGGACACGTAGATTCGGGCAAGTCCACCACTACTGGCCATCTGA 
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TCTATAAATGCGGTGGC^^RvCAAAAGAACCATTGAAAAATTTGAGAAGGAG^^CTGAGATGGGAAA 
GGGCTCCTTCAAGTATGCCTGGGTCTTGGATAAACTGAAAGCTGAGCGTGAACGTGGTATCACCATTGAT 
ATCTCCTTGTGGAAATTTGAGACCAGCAAGTACTATGTGACTATCATTGATGCCCCAGGACACAGAGACT 
TTATCAAAAACATGATTACAGGGACATCTCAGGCTGACTGTGCTGTCCTGATTGTTGCTGCTGGTGTTGG 
5 TGAATTTGAAGCTGGTATCTCCAAGAATGGGCAGACCCGAGAGCATGCCCTTCTGGCTTACACACTGGGT 
GTGAAACAACTAATTGTCGGTGTTAACAAAATGGATTCCACTGAGCCACCCTACAGCCAGAAGAGATATG 
AGGAAATTGTTAAGGAAGTCAGCACTTACATTAAGAAAATTGGCTACAACCCCGACACAGTAGCATTTGT 
GCCAATTTCTGGTTGGAATGGTGACAACATGCTGGAGCCAAGTGCTAACATGCCTTGGTTCAAGGGATGG 
AAAGTCACCCGTAAGGATGGCAATGCCAGTGGAACCACGCTGCTTGAGGCTCTGGACTGCATCCTACCAC 
10 CAACTCGTCCAACTGACAAGCCCTTGCGCCTGCCTCTCCAGGATGTCTACAAAATTGGTGGTATTGGTAC 
TGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGTGGTCACCTTTGCTCCAGTCAAC 
GTTACAACGGAAGTAAAATCTGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATG 
TGGGCTTCAATGTCAAGAATGTGTCTGTCAAGGATGTTCGTCGTGGCAACGTTGCTGGTGACAGCAAAAA 
TGACCCACCAATGGAAGCAGCTGGCTTCACTGCTCAGGTGATTATCCTGAACCATCCAGGCCAAATAAGC 

15 GCCGGCTATGCCCCTGTATTGGATTGCCACACGGCTCACATTGCATGCAAGTTTGCTGAGCTGAAGGAAA 
AGATTGATCGCCGTTCTGGTAAAAAGCTGGAAGATGGCCCTAAATTCTTGAAGTCTGGTGATGCTGCCAT 
TGTTGATATGGTTCCTGGCAAGCCCATGTGTGTTGAGAGCTTCTCAGACTATCCACCTTTGGGTCGCTTT 
GCTGTTCGTGATATGAGACAGACAGTTGCGGTGGGTGTCATCAAAGCAGTGGACAAGAAGGCTGCTGGAG 
CTGGCAAGGTCACCAAGTCTGCCCAGAAAGCTCAGAAGGCTAAATGAATATTATCCCTAATACCTGCCAC 

20 CCCACTCTTAATCAGTGGTGGAAGAACGGTCTCAGAACTGTTTGTTTCAATTGGCCATTTAAGTTTAGTA 
GTAAAAGACTGGTTAATGATAACAATGCATCGTAAAACCTTCAGAAGGAAAGGAGAATGTTTTGTGGACC 
ACTTTGGTTTTCTTTTTTGCGTGTGGCAGTTTTAAGTTATTAGTTTTTAAAATCAGTACTTTTTAATGGA 
AACAACTTGACCAAAAATTTGTC^CAGAATTTTGAGACCCAT^ 
^^^^^ 

25 AAAAAAAAAAAAA 

SEQ ID NO:77 

>gi|450347l|ref | NP_001393 . 1 1 eukaryotic translation elongation factor 1 
alpha 1 [Homo sapiens] 
30 MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGG 

I T I D I S LiWKFETS KYYVT I IDAPGHRDF I KNM I TGTS QADCAVL I VAAGVGE FE AGI S KNGQTREHALLA 
YTLGVKQL I VGVNKMDSTEPPY SQKRYEE I VKEVSTY IKKIGYWPDTVAFVPI SGWNGDNMLEPS ANMPW 
FKGWKVTRKDGNASGTTLLEALDCILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTF 
APVWTTEVKSVEMHHEALSEALPGDWGFNVKWSVKDVRR 

35 GQISAGYAPVIiDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 

LGRFAVRDMRQTVAVGVI kavdkkaagagkvtks aqkaqkak 
SEQ ID NO:78 

Elongation factor 4 gamma (E1F4y) 
40 >GL1_3_1_G3F1 Direction: sense 

GTCCAACTCAAAGATATGGGAGAGGATTTGGAGTGCCTCTGTCAGATAATGAGGACAGTGGGACCTAGATTAGAC 
CATGAACGAGCCAAGTCCTTAATGGATCAGT 

SEQ ID NO:79 
45 >GL1_3_1_G3R1 Direction: anti-sense 

ACTGATCCATTAAGGACTTGGCTCGTTCATGGTCTAATCTAGGTCCCACTGGTCCTCATTATCTGACAGAGGCAC 
TCCAAATCCTCTCCCATATCTTTGAGTTGGAC 

SEQ ID NO:80 

50 >GL1_3__2_G3R1 Direction: anti-sense 

ACTGATCCATTAAGGACTTGGWTCGTTCATGGTCTAATCTANAGTCCCACTGGTCCTCATTATCTGACAGGAGAG 
CACTCCAAATCCTCTCCCATATCTTTGAGTTGGAC 

SEQIDNO:81 

55 >gi (4503538 |ref|NM_001418.l| Homo sapiens eukaryotic translation initiation 
factor 4 gamma, 2 (EIF4G2) , mRNA 

CAGCAGTGAGTCGGAGCTCTATGGAGGTGGCAGCGGGTACCGAGTGGCGGCTGCAGCAGCGACTCCTCTG 
AGCTGAGTTTGAGGCCGTCCCCGACTCCTTCCTCCCCCTTCCCTCCCCCTTTTTTTTGTTTTCCGTTCCC 
CTTTCCCCTCCCTTCCCTATCCCCGACGACCGGATCCTGAGGAGGCAGCTGCGGTGGCAGCTGCTGAGTT 
60 CTCGGTGAAGGTATTTCATTTCTCCTGTCCCCTCCCCTCCCCACCCCATCTATTAATATTATTCTTTTGA 
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TTCTC 



AGATTCTTCGTTGTCAAGCCGCCAAAGTGGAG AGTG CGATTGCAGAAGGGGGTGCTTCTCGTTTCAGTGC 
TTCTTCGGGCGGAGGAGGAAGTAGGGGTGCACCTCAGCACTATCCCAAGACTGCTGGCAACAGCGAGTTC 
CTGGGGAAAACCCCAGGGCAAAACGCTCAGAAATGGATTCCTGCACGAAGCACTAGACGAGATGACAACT 
CCGCAGCAAACAACTCCGCAAACGAAAAAGAACGACATGATGCAATCTTCAGGAAAGTAAGAGGCATACT 
5 AAATAAGCTTACTCCTGAAAAGTTTGACAAGCTATGCCTTGAGCTCCTCAATGTGGGTGTAGAGTCTAAA 
CTCATCCTTAAAGGGGTCATACTGCTGATTGTGGACAAAGCCCTAGAAGAGCCAAAGTATAGCTCACTGT 
ATGCTCAGCTATGTCTGCGATTGGCAGAAGATGCACCAAACTTTGATGGCCCAGCAGCAGAGGGTCAACC 
AGGACUVGAAGCAAAGCACCACATTCAGACGCCTCCTAATTTCCAAATTACAAGATGAATTTGAAAACCGA 
ACTAGAAATGTTGATGTCTATGATAAGCGTGAAAATCCCCTCCTCCCCGAGGAGGAGGAACAGAGAGCCA 

10 TTGCTAAGATCAAGATGTTGGGAAACATCAAATTCATTGGAGAGCTTGGCAAGCTTGATCTTATTCACGA 
ATCTATCCTTCATAAGTGCATCAAAACACTTTTGGAAAAGAAGAAGAGAGTCCAACTCAAAGATATGGGA 
GAGGATTTGGAGTGCCTCTGTCAGATAATGAGGACAGTGGGACCTAGATTAGACCATGAACGAGCCAAGT 
CCTTAATGGATCAGTACTTTGCCCGAATGTGCTCCTTGATGTTAAGTAAGGAATTGCCAGCAAGGATTCG 
TTTCCTGCTGCAGGATACCGTAGAGTTGCGAGAACACCATTGGGTTCCTCGCAAGGCTTTTCTTGACAAT 

15 GGACCAAAGACGATCAATCAAATTCGTCAAGATGCAGTAAAAGATCTAGGGGTGTTTATTCCTGCTCCTA 
TGGCTGAAGGGATGAGAAGTGACTTCTTTCTGGAGGGACCGTTCATGCCACCCAGGATGAAAATGGATAG 
GGACCCACTTGGAGGACTTGCTGATATGTTTGGACAAATGCCAGGTAGCGGAATTGGTACTGGTCCAGGA 
GTTATCCAGGATAGATTTTCACCCACCATGGGACGTCATCGTTCAAATCAACTCTTCAATGGCCATGGGG 
GACAC^TCATGCCTCCCAC^CAATCGCAGTTTGGAGAGATGGGAGGCAAGTTTATGAAAAGCC^GG 

20 

AAGCCAGCTCTACCATAACCAGAGTCAGGGACTCTTATCCCAGCTGCAAGGACAGTCGAAGGATATGCCA 
CCTCGGTTTTCTAAGAAAGGACAGCTTAATGCAGATGAGATTAGCCTGAGGCCTGCTCAGTCGTTCCTAA 
TGAATAAAAATCAAGTGCCAAAGCTTCAGCCCCAGATAACTATGATTCCTCCTAGTGCACAACCACCACG 
CACTCAAACACCACCTCTGGGACAGACACCTCAGCTTGGTCTCAAAACTAATCCACCACTTATCCAGGAA 
AAGCCTGCCAAGACCAGCAAAAAGCCACCACCGTCAAAGGAAGAACTCCTTAAACTAACTGAAACTGTT^ 

25 TGACTGAATATCTAAATAGTGGAAATGCAAATGAGGCTGTCAATGGTGTAAGAGAAATGAGGGCTCCTAA 
AC^CTTTCTTCCTGAGATGTTAAGCAAAGTAATC^TCCTGTCACTAGATAGAAGCGATGAAGATAAAG^ 
AAAGCAAGTTCTTTGATCAGTTTACTCAAACAGGAAGGGATAGCCACAAGTGACAACTTCATGCAGGCTT 
TCCTGAATGTATTGGACCAGTGTCCCAAACTGGAGGTTGACATCCCTTTGGTGAAATCCTATTTAGCACA 
GTTTGCAGCTCGTGCCATCATTTCAGAGCTGGTGAGCATTTCAGAACTAGCTCAACCACTAGAAAGTGGC 

30 ACCCATTTTCCTCTCTTCCTACTTTGTCTTCAGCAGTTAGCTAAATTACAAGATCGAGAATGGTTAACAG 
AACTTTTTCAACAAAGCAAGGTCAATATGCAGAAAATGCTCCCAGAAATTGATCAGAATAAGGACCGCAT 
GTTGGAGATTTTGGAAGGAAAGGGACTGAGTTTCTTATTCCCACTCCTCAAATTGGAGAAGGAACTGTTG 
AAGGAAATAAAGTTGGATCCATCCCCTCAAACCATATATAAATGGATTAAAGATAACATCTCTCCCAAAC 
TTCATGTAGATAAAGGATTTGTGAACATCTTAATGACTAGCTTCTTACAGTACATTTCTAGTGAAGTAAA 

35 CCCCCCCAGCGATGAAACAGATTCATCCTCTGCTCCTTCCAAAGAACAGTTAGAGCAGGAAAAACAACTA 
CTACTATCTTTCAAGCCAGTAATGCAGAAATTTCTTCATGATCACGTTGATCTACAAGTCAGTGCCCTGT 
ATGCTCTCCAGGTGCACTGCTATAACAGCAACTTCCCAAAAGGCATGTTACTTCGCTTTTTTGTGCACTT 
CTATGACATGGAAATTATTGAAGAAGAAGCTTTCTTGGCTTGGAAAGAAGATATAACCCAAGAGTTTCCG 
GGAAAAGGCAAGGCTTTGTTCCAGGTGAATCAGTGGCTAACCTGGTTAGAAACTGCTGAAGAAGAAGAAT 

40 

CAGAGGAAGAAGCTGA CTAAA GAACCAGCCAAAGCCTTAAATTGTGCAAAACATACTGTTGCTATGATGT 
AACTGCATTTGACCTAACCACTGCGAAAATTCATTCCGCTGTAATGTTTTCACAATATTTAAAGCAGAAG 
CACGTCAGTTAGGATTTCCTTCTGCATAAGGTTTTTTTGTAGTGTAATGTCTTAATCATAGTCTACCATC 
AAATATTTTAGGAGTATCTTTAATGTTTAGATAGTATATTAGCAGCATGCAATAATTACATCATAAGTTC 
TCAAGCAGAGGCAGTCTATTGCAAGGACCTTCTTTGCTGCCAGTTATCATAGGCTGTTTTAAGTTAGAAA 

45 ACTGAATAGCAACACTGAATACTGTAGAAATGCACTTTGCTCAGTAATACTTGAGTTGTTGCAATATTTG 
ATTATCCATTTGGTTGTTACAGAAAAATTCTTAACTGTAATTGATGGTTGTTGCCGTAATAGTATATTGC 
CTGTATTTCTACCTCTAGTAATGGGCTTTATGTGCTAGATTTTAATATCCTTGAGCCTGGGCAAGTGCAC 
AAGTCTTTTTAAAAGAAACATGGTTTACTTGCACAAAACTGATCAGTTTTGAGAGATCGTTAATGCCCTT 
GAAGTGGTTTTTGTGGGTGTGAAAC^UUVTGGTGAGAATTTGAATTGGTCCCTCCTATTATAGTATTGAAA 

50 TTAAGTCTACTTAATTTATCAAGTCATGTTCATGCCCTGATTTTATATACTTGTATCTATCAATAAACAT 
TGTGATACTTGAAAAAAAAAAAAAAAA?IAAAAAAAAAAAA 

SEQ ID NO:82 

>gi | 4503539 | ref | NP__0014 09 . 1 1 eukaryotic translation initiation factor 4 
55 gamma, 2/ Eukaryotic translation initiation factor 4G-like 1; p97 [Homo 
sapiens] 

MESAIAEGGASRFSASSGGGGSRGAPQHYPKTAGNSEFLGKTPGQNAQKWIPARSTRRDDNSAAimSA^ 
KERHDAIFRKVRGILNKLTPEKFD 

ED APNFDGPAAEGQPGQKQSTTFRRLL I S KLQDEFENRTRNVDVYDKRENPLLPEEEEQRAI AKIKMliGN 
60 IKP1GELGKLDLIHESILHKCIKTLLEKKKRVQ 

MCSLMLSKELPARIRFLLQDTVELREHHWVPRKAFLDNGPKTINQIRQDAVKDLGVFIPAPM^ 
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FLEGPFMPPRMKMDRDPL^^MFGQMPGSGIGTGPGVIQDRFSP™ 

QFGEMGGKPMKSQGLSQLYHNQSQGLLSQLQGQSKDMPPRFSKKGQIjNiUDEISLRPAQSFLMNKNQVP^ 
QPQITMIPPSAQPPRTQTPPLGQTPQLGLKTNPPLIQEKPAKTSKKPPPSKEELLKXjTETVVTEYLNSG 
ANEAWGVREMRAPKHFLPEMLSKVIILSLDRSDEDK^ 
5 KLEVDIPLVKSYLAQFAARAIISELVSISELAQ 

MQKMLPEIDQNKDRMLEILEGKGLSFLFPLLKLEKEL^^ 

ILMTSFLQYISSEWPPSDETDSSSAPSKEQLEQEKQIjLLSFKPVMQKPLHDHVDLQVSAIiYALQVHC™ 
SNFPKGMLLRFFVHFYDMEIIEEEAFIAWKEDITQEFPGKGKALFQVNQWLTWLETAEEEESEEEAD 

10 SEQIDNO:83 

Filamin A 

>GL3_11C_PCR_G3F1 Direction: sense 

ACATCCAGGACACGGTGATGGCACGCACACCATTACCTACATTCCCCTCTGCCCCGGGGCCTACACCGTCACCAT 
CAAGTACGGCA 

15 

SEQ ID NO:84 

>GL3_15A_2_M13R Direction: anti -sense 

TGCCGTACTTGATGGTGACGGTGTAGGCCCCGGGGCAGAGGGGAATGTAGGTAATGGTGTGCGTGCCATCACCGT 
GGTCCTGGATGTTCGCAGAAACCAGCACACTGGAGCCACCGCTGCACCGGGCTTGGGGGCTCCTCCTTGTACAGC 
20 TCGTCCATGCCGAGAGAAGGGCGAATTCTTGCAGATATNCATCACACTGGCGGGCGCTCGAGC 

SEQ ID NO:85 

>GL3_32C_PCR_G3F1 Direction: sense 

ACATCAGGACCACGGATGATGGCACGCACACCATACCTACATCCCCTCTGCCCCGGGGCCTACACCGTCACCATC 
25 AAGTACGGCACCGNAGAACCCAGCACAGTGGTNAGATAGATAAAGCGGCCGCTCGACTAGTCTGAGGTCTGAATA 
CTCAACTGAACTGTACGTAAAAACAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAA 

30 SEQ ID NO:86 

>gi [4503744 |ref|NM_001456.l| Homo sapiens filamin A, alpha (actin binding 
protein 2 8 0) (FLNA) , mRNA 

GCGATCCGGGCGCCACCCCGCGGTCATCGGTCACCGGTCGCTCTCAGGAACAGCAGCGCAACCTCTGCTC 
CCTGCCTCGCCTCCCGCGCGCCTAGGTGCCTGCGACTTTAATTAAAGGGCCGTCCCCTCGCCGAGGCTGC 

35 AGCACCGCCCCCCCGGCTTCTCGCGCCTCAAAATGAGTAGCTCCCACTCTCGGGCGGGCCAGAGCGCAGC 
AGGCGCGGCTCCGGGCGGCGGCGTCGACACGCGGGACGCCGAGATGCCGGCCACCGAGAAGGACCTGGCG 
GAGGACGCGCCGTGGAAGAAGATCCAGCAGAACACTTTCACGCGCTGGTGCAACGAGCACCTGAAGTGCG 
TGAGCAAGCGCATCGCCAACCTGCAGACGGACCTGAGCGACGGGCTGCGGCTTATCGCGCTGTTGGAGGT 
GCTCAGCCAGAAGAAGATGCACCGCAAGCACAACCAGCGGCCCACTTTCCGCCAAATGCAGCTTGAGAAC 

40 GTGTCGGTGGCGCTCGAGTTCCTGGACCGCGAGAGCATCAAACTGGTGTCCATCGACAGCAAGGCCATCG 
TGGACGGGAACCTGAAGCTGATCCTGGGCCTCATCTGGACCCTGATCCTGCACTACTCCATCTCCATGCC 
CATGTGGGACGAGGAGGAGGATGAGGAGGCCAAGAAGCAGACCCCCAAGCAGAGGCTCCTGGGCTGGATC 
CAGAACAAGCTGCCGCAGCTGCCCATCACCAACTTCAGCCGGGACTGGCAGAGCGGCCGGGCCCTGGGCG 
CCCTGGTGGACAGCTGTGCCCCGGGCCTGTGTCCTGACTGGGACTCTTGGGACGCCAGCAAGCCCGTTAC 

45 CAATGCGCGAGAGGCCATGCAGCAGG CGGATGACTGGCTGGGCATCCCCCAGGTGATC AC CC CCGAGGAG 
ATTGTGGACCCCAACGTGGACGAGCACTCTGTCATGACCTACCTGTCCCAGTTCCCCAAGGCCAAGCTGA 
AGCCAGGGGCTCCCTTGCGCCCCAAACTGAACCCGAAGAAAGCCCGTGCCTACGGGCCAGGCATCGAGCC 
CACAGGCAACATGGTGAAGAAGCGGGCAGAGTTCACTGTGGAGACCAGAAGTGCTGGCCAGGGAGAGGTG 
CTGGTGTACGTGGAGGACCCGGCCGGACACCAGGAGGAGGCAAAAGTGACCGCCAATAACGACAAGAACC 

50 GCACCTTCTCCGTCTGGTACGTCCCCGAGGTGACGGGGACTCATAAGGTTACTGTGCTCTTTGCTGGCCA 
GCACATCGCCAAGAGCCCCTTCGAGGTGTACGTGGATAAGTCACAGGGTGACGCCAGCAAAGTGACAGCC 
CAAGGTCCCGGCCTGGAGCCCAGTGGCAACATCGCCAACAAGACCACCTACTTTGAGATCTTTACGGCAG 
GAGCTGGCACGGGCGAGGTCGAGGTTGTGATCCAGGACCCCATGGGACAGAAGGGCACGGTAGAGCCTCA 
GCTGGAGGCCCGGGGCGACAGCACATACCGCTGCAGCTACCAGCCCACCATGGAGGGCGTCCACACCGTG 

55 CACGTCACGTTTGCCGGCGTGCCCATCCCTCGCAGCCCCTACACTGTCACTGTTGGCCAAGCCTGTAACC 
CGAGTGCCTGCCGGGCGGTTGGCCGGGGCCTCCAGCCCAAGGGTGTGCGGGTGAAGGAGACAGCTGACTT 
CAAGGTGTACACAAAGGGCGCTGGCAGTGGGGAGCTGAAGGTCACCGTGAAGGGCCCCAAGGGAGAGGAG 
CGCGTGAAGCAGAAGGACCTGGGGGATGGCGTGTATGGCTTCGAGTATTACCCCATGGTCCCTGGAACCT 
ATATCGTCACCATCACGTGGGGTGGTCAGAACATCGGGCGCAGTCCCTTCGAAGTGAAGGTGGGCACCGA 

60 GTGTGGCAATCAGAAGGTACGGGCCTGGGGCCCTGGGCTGGAGGGCGGCGTCGTTGGCAAGTCAGCAGAC 
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TTTGTGGTGGAGGCTATCGGGGACGACGTGGGCACGCTGGGCTTCTCGGTGGAAGd^CATCGCAGGCTA 
AGATCGAATGTGACGACAAGGGCGACGGCTCCTGTGATGTGCGCTACTGGCCGCAGGAGGCTGGCGAGTA 
TGCCGTTCACGTGCTGTGCAACAGCGAAGACATCCGCCTCAGCCCCTTCATGGCTGACATCCGTGACGCG 
CCCCAGGACTTCCACCCAGACAGGGTGAAGGCACGTGGGCCTGGATTGGAGAAGACAGGTGTGGCCGTCA 
5 ACAAGCCAGCAGAGTTCACAGTGGATGCCAAGCACGGTGGCAAGGCCCCACTTCGGGTCCAAGTCCAGGA 
CAATGAAGGCTGCCCTGTGGAGGCGTTGGTCAAGGACAACGGCAATGGCACTTACAGCTGCTCCTACGTG 
CCCAGGAAGCCGGTGAAGCACACAGCCATGGTGTCCTGGGGAGGCGTCAGCATCCCCAACAGCCCCTTCA 
GGGTGAATGTGGGAGCTGGCAGCCACCCCAACAAGGTCAAAGTATACGGCCCCGGAGTAGCCAAGACAGG 
GCTCAAGGCCCACGAGCCCACCTACTTCACTGTGGACTGCGCCGAGGCTGGCCAGGGGGACGTCAGCATC 
10 GGCATCAAGTGTGCCCCTGGAGTGGTAGGCCCCGCCGAAGCTGACATCGACTTCGACATCATCCGCAATG 
ACAATGACACCTTCACGGTCAAGTACACGCCCCGGGGGGCTGGCAGCTACACCATTATGGTCCTCTTTGC 
TGACCAGGCCACGCCCACCAGCCCCATCCGAGTCAAGGTGGAGCCCTCTCATGACGCCAGTAAGGTGAAG 
GCCGAGGGCCCTGGCCTCAGTCGCACTGGTGTCGAGCTTGGCAAGCCCACCCACTTCACAGTAAATGCCA 
AAGCTGCTGGCAAAGGCAAGCTGGACGTCCAGTTCTCAGGACTCACCAAGGGGGATGCAGTGCGAGATGT 
15 GGACATCATCGACCACCATGACAACACCTACACAGTCAAGTACACGCCTGTCCAGCAGGGTCCAGTAGGC 
GTCAATGTCACTTATGGAGGGGATCCCATCCCTAAGAGCCCTTTCTCAGTGGCAGTATCTCCAAGCCTGG 
ACCTCAGCAAGATCAAGGTGTCTGGCCTGGGAGAGAAGGTGGACGTTGGCAAAGACCAGGAGTTCACAGT 
CAAATCAAAGGGTGCTGGTGGTCAAGGCAAAGTGGCATCCAAGATTGTGGGCCCCTCGGGTGCAGCGGTG 
CCCTGCAAGGTGGAGCCAGGCCTGGGGGCTGACAACAGTGTGGTGCGCTTCCTGCCCCGTGAGGAAGGGC 
20 CCTATGAGGTGGAGGTGACCTATGACGGCGTGCCCGTGCCTGGCAGCCCCTTTCCTCTGGAAGCTGTGGC 
CCCCACCAAGCCTAGCAAGGTGAAGGCGTTTGGGCCGGGGCTGCAGGGAGGCAGTGCGGGCTCCCCCGCC 
CGCTTCACCATCGACACCAAGGGCGCCGGCACAGGTGGCCTGGGCCTGACGGTGGAGGGCCCCTGTGAGG 
CGCAGCTCGAGTGCTTGGACAATGGGGATGGCACATGTTCCGTGTCCTACGTGCCCACCGAGCCCGGGGA 
CTACAACATCAACATCCTCTTCGCTGACACCCACATCCCTGGCTCCCCATTCAAGGCCCACGTGGTTCCC 
25 TGCTTTGACGCATCCAAAGTCAAGTGCTCAGGCCCCGGGCTGGAGCGGGCCACCGCTGGGGAGGTGGGCC 
AATTCCAAGTGGACTGCTCGAGCGCGGGCAGCGCGGAGCTGACCATTGAGATCTGCTCGGAGGCGGGGCT 
TCCGGCCGAGGTGTACATCCAGGACCACGGTGATGGCACGCACACCATTACCTACATTCCCCTCTGCCCC 
GGGGCCTACACCGTCACCATCAAGTACGGCGGCCAGCCCGTGCCCAACTTCCCCAGCAAGCTGCAGGTGG 
AACCTGCGGTGGACACTTCCGGTGTCCAGTGCTATGGGCCTGGTATTGAGGGCCAGGGTGTCTTCCGTGA 
30 GGCCACCACTGAGTTCAGTGTGGACGCCCGGGCTCTGACACAGACCGGAGGGCCGCACGTCAAGGCCCGT 
GTGGCCAACCCCTCAGGCAACCTGACGGAGACCTACGTTCAGGACCGTGGCGATGGCATGTACAAAGTGG 
AGTACACGCCTTACGAGGAGGGACTGCACTCCGTGGACGTGACCTATGACGGCAGTCCCGTGCCCAGCAG 
CCCCTTCCAGGTGCCCGTGACCGAGGGCTGCGACCCCTCCCGGGTGCGTGTCCACGGGCCAGGCATCCAA 
AGTGGCACCACCAACAAGCCCAACAAGTTCACTGTGGAGACCAGGGGAGCTGGCACGGGCGGCCTGGGCC 
35 TGGCTGTAGAGGGCCCCTCCGAGGCCAAGATGTCCTGCATGGATAACAAGGACGGCAGCTGCTCGGTCGA 
GTACATCCCTTATGAGGCTGGCACCTACAGCCTCAACGTCACCTATGGTGGCCATCAAGTGCCAGGCAGT 
CCTTTCAAGGTCCCTGTGCATGATGTGACAGATGCGTCCAAGGTCAAGTGCTCTGGGCCCGGCCTGAGCC 
CAGGCATGGTTCGTGCCAACCTCCCTCAGTCCTTCCAGGTGGACACAAGCAAGGCTGGTGTGGCCCCATT 
GCAGGTCAAAGTGCAAGGGCCCAAAGGCCTGGTGGAGCCAGTGGACGTGGTAGACAACGCTGATGGCACC 
40 CAGACCGTCAATTATGTGCCCAGCCGAGAAGGGCCCTACAGCATCTCAGTACTGTATGGAGATGAAGAGG 
TACCCCGGAGCCCCTTCAAGGTCAAGGTGCTGCCTACTCATGATGCCAGCAAGGTGAAGGCCAGTGGCCC 
CGGGCTCAACACCACTGGCGTGCCTGCCAGCCTGCCCGTGGAGTTCACCATCGATGCAAAGGACGCCGGG 
GAGGGCCTGCTGGCTGTCCAGATCACGGATCCCGAAGGCAAGCCGAAGAAGACACACATCCAAGACAACC 
ATGACGGCACGTATACAGTGGCCTACGTGCCAGACGTGACAGGTCGCTACACCATCCTCATCAAGTACGG 
45 TGGTGACGAGATCCCCTTCTCCCCGTACCGCGTGCGTGCCGTGCCCACCGGGGACGCCAGCAAGTGCACT 
GTCACAGTGTCAATCGGAGGTCACGGGCTAGGTGCTGGCATCGGCCCCACCATTCAGATTGGGGAGGAGA 
CGGTGATCACTGTGGACACTAAGGCGGCAGGCAAAGGCAAAGTGACGTGCACCGTGTGCACGCCTGATGG 
CTCAGAGGTGGATGTGGACGTGGTGGAGAATGAGGACGGCACTTTCGACATCTTCTACACGGCCCCCCAG 
CCGGGCAAATACGTCATCTGTGTGCGCTTTGGTGGCGAGCACGTGCCCAACAGCCCCTTCCAAGTGACGG 
50 CTCTGGCTGGGGACCAGCCCTCGGTGCAGCCCCCTCTACGGTCTCAGCAGCTGGCCCCACAGTACACCTA 
CGCCCAGGGCGGCCAGCAGACTTGGGCCCCGGAGAGGCCCCTGGTGGGTGTCAATGGGCTGGATGTGACC 
AGCCTGAGGCCCTTTGACCTTGTCATCCCCTTCACCATCAAGAAGGGCGAGATCACAGGGGAGGTTCGGA 
TGCCCTCAGGCAAGGTGGCGCAGCCCACCATCACTGACAACAAAGACGGCACCGTGACCGTGCGGTATGC 
ACCCAGCGAGGCTGGCCTGCACGAGATGGACATCCGCTATGACAACATGCACATCCCAGGAAGCCCCTTG 
55 CAGTTCTATGTGGATTACGTCAACTGTGGCCATGTCACTGCCTATGGGCCTGGCCTCACCCATGGAGTAG 
TGAACAAGCCTGCCACCTTCACCGTCAACACCAAGGATGCAGGAGAGGGGGGCCTGTCTCTGGCCATTGA 
GGGCCCGTCCAAAGCAGAAATCAGCTGCACTGACAACCAGGATGGGACATGCAGCGTGTCCTACCTGCCT 
GTGCTGCCGGGGGACTACAGCATTCTAGTCAAGTACAATGAACAGCACGTCCCAGGCAGCCCCTTCACTG 
CTCGGGTCACAGGTGACGACTCCATGCGTATGTCCCACCTAAAGGTCGGCTCTGCTGCCGACATCCCCAT 
60 CAACATCTCAGAGACGGATCTCAGCCTGCTGACGGCCACTGTGGTCCCGCCCTCGGGCCGGGAGGAGCCC 
TGTTTGCTGAAGCGGCTGCGTAATGGCCACGTGGGGATTTCATTCGTGCCCAAGGAGACGGGGGAGCACC 
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TGGTGCATGTGAAGAAA^SiroCCAGCACGTGGCCAGCAGCCCCATCCCGGTGSRTCAGCCAGTCGGA 

AATTGGGGATGCCAGTCGTGTTCGGGTCTCTGGTCAGGGCCTTCACGAAGGCCACACCTTTGAGCCTGCA 
GAGTTTATCATTGATACCCGCGATGCAGGCTATGGTGGGCTCAGCCTGTCCATTGAGGGCCCCAGCAAGG 
TGGACATCAACACAGAGGACCTGGAGGACGGGACGTGCAGGGTCACCTACTGCCCCACAGAGCCAGGCAA 
5 CTACATCATCAACATCAAGTTTGCCGACCAGCACGTGCCTGGCAGCCCCTTCTCTGTGAAGGTGACAGGC 
GAGGGCCGGGTGAAAGAGAGCATCACCCGCAGGCGTCGGGCTCCTTCAGTGGCCAACGTTGGTAGTCATT 
GTGACCTCAGCCTGAAAATCCCTGAAATTAGCATCCAGGATATGACAGCCCAGGTGACCAGCCCATCGGG 
CAAGACCCATGAGGCCGAGATCGTGGAAGGGGAGAACCACACCTACTGCATCCGCTTTGTTCCCGCTGAG 
ATGGGCACACACACAGTCAGCGTCAAGTACAAGGGCCAGCACGTGCCTGGGAGCCCCTTCCAGTTCACCG 
10 TGGGGCCCCTAGGGGAAGGGGGAGCCCACAAGGTCCGAGCTGGGGGCCCTGGCCTGGAGAGAGCTGAAGC 
TGGAGTGCCAGCCGAATTCAGTATCTGGACCCGGGAAGCTGGTGCTGGAGGCCTGGCCATTGCTGTCGAG 
GGCCCCAGCAAGGCTGAGATCTCTTTTGAGGACCGCAAGGACGGCTCCTGTGGTGTGGCTTATGTGGTCC 
AGGAGCCAGGTGACTACGAAGTCTCAGTCAAGTTCAACGAGGAACACATTCCCGACAGCCCCTTCGTGGT 
GCCTGTGGCTTCTCCGTCTGGCGACGCCCGCCGCCTCACTGTTTCTAGCCTTCAGGAGTCAGGGCTAAAG 

15 GTCAACCAGCCAGCCTCTTTTGCAGTCAGCCTGAACGGGGCCAAGGGGGCGATCGATGCCAAGGTGCACA 
GCCCCTCAGGAGCCCTGGAGGAGTGCTATGTCACAGAAATTGACCAAGATAAGTATGCTGTGCGCTTCAT 
CCCTCGGGAGAATGGCGTTTACCTGATTGACGTCAAGTTCAACGGTACCCACATCCCTGGAAGCCCCTTC 
AAGATCCGAGTTGGGGAGCCTGGGCATGGAGGGGACCCAGGCTTGGTGTCTGCTTACGGAGCAGGTCTGG 
AAGGCGGTGTCACAGGGAACCCAGCTGAGTTCGTCGTGAACACGAGCAATGCGGGAGCTGGTGCCCTGTC 

20 GGTGACCATTGACGGCCCCTCCAAGGTGAAGATGGATTGCCAGGAGTGCCCTGAGGGCTACCGCGTCACC 
TATACCCCCATGGCACCTGGCAGCTACCTCATCTCCATCAAGTACGGCGGCCCCTACCACATTGGGGGCA 
GCCCCTTCAAGGCCAAAGTCACAGGCCCCCGTCTCGTCAGCAACCACAGCCTCCACGAGACATCATCAGT 
GTTTGTAGACTCTCTGACCAAGGCCACCTGTGCCCCCCAGCATGGGGCCCCGGGTCCTGGGCCTGCTGAC 
GCCAGCAAGGTGGTGGCCAAGGGCCTGGGGCTGAGCAAGGCCTACGTAGGCCAGAAGAGCAGCTTCACAG 

25 TAGACTGCAGCAAAGCAGGCAACAACATGCTGCTGGTGGGGGTTCATGGCCCAAGGACCCCCTGCGAGGA 
GATCCTGGTGAAGCACGTGGGCAGCCGGCTCTACAGCGTGTCCTACCTGCTCAAGGACAAGGGGGAGTAC 
ACACTGGTGGTCAAATGGGGGCACGAGCACATCCCAGGCAGCCCCTACCGCGTTGTGGTGCCCTGAGTCT 
GGGGCCCGTGCCAGCCGGCAGCCCCCAAGCCTGCCCCGCTACCCAAGCAGCCCCGCCCTCTTCCCCTCAA 
CCCCGGCCCAGGCCGCCCTGGCCGCCCGCCTGTCACTGCAGCTGCCCCTGCCCTGTGCCGTGCTGCGCTC 

30 ACCTGCCTCCCCAGCCAGCCGCTGACCTCTCGGCTTTCACTTGGGCAGAGGGAGCCATTTGGTGGCGCTG 
CTTGTCTTCTTTGGTTCTGGGAGGGGTGAGGGATGGGG 

SEQ ID NO:87 

>gi|4503745|ref |NP_001447 . 1 1 filaminl (act in-binding protein-280) ; filamin 
35 A, alpha (actin-binding protein-280) ; filamin 1; actin-binding protein-280 
[ Homo s ap i en s ] 

MSSSHSRAGQSAAGAAPGGGVDTRDAEMPATEKDLAEDAPW^ 
LSDGLRLIALLEVLSQKKMHRKHNQRPTFRQMQ 
IWTLILHYSISMPMWDEEEDEEAKKQT^ 
40 PDWDSWDASKPVTNAREAMQQADDWLGIPQVITPEEIVDP 
PKICARAYGPGIEPTGNlvrVTKKRAEFTVETRSAGQGEVLV^ 
TGTHKVTVLFAGQHIAKSPFEVYVDKSQGDASKV^ 

QD PMGQKGTVE PQLEARGD STYRCS YQPTMEGVHTVHVTF AGVP I PRS PYTVTVGQACNPS ACRAVGRGIj 
QPKGVRVKETADFKVYTKGAGSGELKVTVKGPKGEERVKQKDLGDGVYGFEYYPMVPGTO 

45 IGRSPFEVKVGTECGNQKVRAWGPGLEGGWGKSADFWEAIGDDVGTLGFSVEGPSQAKIECDDKGDGS 
CD VRYWPQEAGE YAVHVLCNSED I RLS PFMAD I RDAPQD FHPDRVKARGPGLEKTGVAVNKPAEFTVDAK 
HGGKAPLRVQVQDNEG CP VEALVKDNGNGTYS CS YVPRKP VKHTAMVS WGGVS I PNS PFRVNVGAGS HPN 
KVTCVYGPGVAKTGLKAHEPTYFTVDCAEAGQGDVS IGIKCAPGWGPAEADIDFD I IRNDNDTFTVKYTP 
RGAGSYTIMVLFADQATPTSPIRVK^PSHDASKVKAEGPGLS 

50 FS GLTKGDAVRDVD I IDHHDNTYTVKYTPVQQGPVGVNVTYGGDPI PKS PFSVAVS PSLDLSKI KYSGLG 
EKVDVGKDQEFTVKS KGAGGQGKVAS KIVGPSGAAVPCKVEPGLGADNSVVRFLPREEGPYEVEVTYDGV 
PVPGSPFPLEAVAPTKPSKVKAFGPGLQGGSAGSPARFTIDTKGAGTGGLGLTVEGPCEAQLECLDNGDG 
TCSVSYVPTEPGDYNINILFADTHIPGSPFKAHWPCFDASK^KCSGPGLERATAGEVGQFQVDCSSAGS 
AELTIEICSEAGLPAEVYIQDHGDGTHTITYIPLCPGAYTVTIKYGGQPVPNFPSKLQVEPAVDTSGVQC 

5 5 YGPGIEGQGVFREATTEFSVDARALTQTGGPHVKARVANPSGNLTETYVQDRGDGMYKVEYTPYEEGLHS 
VDVTYDGSPVPSSPFQVPVTEGCDPSRVRVHGPGIQSGTTNKPNKFTVETRGAGTGGLGLAVEGPSEAKM 
SCMDNKEGSCSVEYIPYEAGTYSLNVTYGGHQVPGSP 

FQVDTS KAGVAPLQVKVQGPKGLVE PVDWDNADGTQTVNYVPS REGPYS I S VLYGDEEVPRS PFKVKVL 
PTHDAS KVKAS GPGLNTTGVPAS LP VE FT IDAKDAGEGLLAVQ I TDPEGKPKKTHI QDNHDGTYTVAYVP 
60 DVTGRYTILIKYGGDEIPFSPYRVRAVPTGDASKCTVTV^ 

KGKVTCTVCTPDGSEVDVDWENEDGTFDIFYTAPQPGKYVICVRFGGEHVPNSPFQVTAIiAGDQPSVQP 
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PLRSQQIiAPQYTYAQGGQQT^RpERPLVGVNGL 

TDNKDGTVTVRYAPSEAGLHEMD IRYDNMHI PGS PLQF YVD YVNCGHVTAYGPGLTHG WNKPATFTVNT 
KDAGEGGLSLAIEGPSKAEISCTDNQDGTCSVSYLPVLPGDYSILVKYNEQHVPGSPFTARVTGDDSMRM 
SHLKVGSAADIPINISETDLSLLTATVVPPSGREEPCLLKRLRNGHVGISFVPKETGEH^ 
5 ASSPIPWISQSEIGDASRVRVSGQGLHEGHTFEPAEFIIDTRDAGYGGLSLSIEGPSKVDINTEDLEDG 
TCRVTYCPTEPGNYIINIKFADQHVPGSPFSVK\nTGEGRVKESITRRRRAPSVANV 
IQDMTAQVTSPSGKTHEAEIVEGENHTYCIRFVPAEMGTHTVSVKYKGQHVPGSPFQFW 
VRAGGPGLERAEAGVPAEFS I WTREAGAGGLAI AVEGPS KAE I S FEDRKDGS CGVAYWQEPGD YEVS VK 
FNEEHI PDS PFWPVAS PSGDARRLTVS S LQES GLKVNQPAS FAVS LNGAKGAIDAKVHS PSGALEEC YV 
10 TEIDQDKYAVRFI PRENGVYLIDVKFNGTHIPGS PFKIRVGEPGHGGDPGLVSAYGAGLEGGVTGNPAEF 
VVNTSNAGAGALSVTIDGPSKVKMDCQE 
kVSNHSLHETSSVFVDSLTKATCAPQHGAPGPGPADASKW 

LVGVHGPRTPCEE ILVKHVGSRLYS VS YLLKDKGEYTLWKWGHEHI PGS PYRVWP 

15 SEQIDNO:88 
Gal3 protein 

>GL2_4 6_3_M13R Direction: sense 

CCGCGAGGGCAGCGGCGGCGGCAAGATGGCGGACTTCCTGCCGTCGCGGTCCGTGCTGTCCGTGTGCTTCCCCGG 



GNCTGCCTGCTGACGAGTGGCGAGG 

SEQ ID NO:89 

>gi | 5729847 |ref | NM_006572 . 1 | Homo sapiens guanine nucleotide binding 
protein (G protein), alpha 13 (GNA13) , mRNA 

TGGGGCCGGAGAGGCGGCGAGGCGGCGGCGGCGGCGGCAAGATGGCGGACTTCCTGCCGTCGCGGTCCGT 

GCTGTCCGTGTGCTTCCCCGGCTGCCTGCTGACGAGTGGCGAGGCCGAGCAGCAACGCAAGTCCAAGGAG 

ATCGACAAATGCCTGTCTCGGGAAAAGACCTATGTGAAGCGGCTGGTGAAGATCCTGCTGCTGGGCGCGG 

GCGAGAGCGGCAAGTCCACCTTCCTGAAGCAGATGCGGATCATCCACGGGCAGGACTTCGACCAGCGCGC 

GCGCGAGGAGTTCCGCCCCACCATCTACAGCAACGTGATCAAAGGTATGAGGGTGCTGGTTGATGCTCGA 

GAGAAGCTTCATATTCCCTGGGGAGACAACTCAAACCAACAACATGGAGATAAGATGATGTCGTTTGATA 

CCCGGGCCCCCATGGCAGCCCAAGGAATGGTGGAAACAAGGGTTTTCTTACAATATCTTCCTGCTATAAG 

AGCATTATGGGCAGACAGCGGCATACAGAATGCCTATGACCGGCGTCGAGAATTTCAACTGGGTGAATCT 

GTAAAATATTTCCTGGATAACTTGGATAAACTTGGAGAACCAGATTATATTCCATCACAACAAGATATTC 

TGCTTGCCAGAAGACCCACCAAAGGCATCCATGAATACGACTTTGAAATAAAAAATGTTCCTTTCAAAAT 

GCTTGATGTAGGTGGTCAGAGATCAGAAAGGAAACGTTGGTTTGAATGTTTCGACAGTGTGACATCAATA 

CTTTTCCTTGTTTCCTCAAGTGAATTTGACCAGGTGCTTATGGAAGATCGACTGACCAATCGCCTTACAG 

AGTCTCTGAACATTTTTGAAACAATCGTCAATAACCGGGTTTTCAGCAATGTCTCCATAATTCTGTTCTT 

AAACAAGACAGACTTGCTTGAGGAGAAGGTGCAAATTGTGAGCATCAAAGACTATTTCCTAGAATTTGAA 

GGGGATCCCCACTGCTTAAGAGACGTCCAAAAATTCCTGGTGGAATGTTTCCGGAACAAACGCCGGGACC 

AGCAACAGAAGCCCTTATACCACCACTTCACCACTGCTATCAACACGGAGAACATCCGCCTTGTTTTCCG 

CGACGTGAAGGATACTATTCTGCATGACAACCTCAAGCAGCTTATGCTACAGTGATGTACAAAAGACTTG 

CTGTTTTAATATCTTTTTGTGTTTTTGATGTTTTCTGTTTGTTTTGTTTTTTAAAATAGCAGTTTACAAC 

CAGAATTAGAACAATCTTAATTCTACGTTTAACTTCTTGAAAATCTTAGTACTTTTTCTGCGGCCTTTGG 

TTTGTGGCTGAAAGCTGTTGAGTGACTCATCGCCAAGATTTGCTGTAATGCAGGCTTTGATCTGTTTCAC 
CC 

SEQIDNO:90 

>gi| 5729848 |ref | NP_006563 . 1 1 guanine nucleotide binding protein (G 
protein), alpha 13; guanine nucleotide binding protein, alpha-13 [Homo 
sapiens] 

MADFLPSRSVLSVCFPGCLLTSGEAEQQRKSKEIDKCLSREKTYVKRLVKILLLGAGESGKSTFLKQ^IRI 
IHGQDFDQRAREEFRPTIYSNVIKGMRVLVDAREKLHIPWGD^^ 

VFLQYLPAIRAIiWADSGIQNAYDRRREFQLGESVKYFLDNLDKLGEPDYIPSQQDILLARRPTKGIHEYD 
FEIKJsTVPFKMLDVGGQRSERKRWFECFDSVTSILFLVSSSEFDQVLMEDRLTNRLTE 
FSNVS I ILFLNKTDLLEEKVQI VS IKDYFLEFEGDPHCLRDVQKFLVECFRNKRRDQQQKPLYHHFTTAI 
55 NTEN I RLVFRDVKDT I LHDNXi KQLMLQ 
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HOXB2 

>GL3_llD__lJVfl3R Direction: sense 

TTTGCCCTGACCCGGGCCATTGCCAGAAGACGTCTTCTCGGGGCGCCAGGATTCACCTTTCCTTCCCGACCTCAA 
5 CTTCTTCGCGGCCGACTCCTGTCTCCAGCTATCCGGAGGCCTCTCCCCTAGCCTACAGGGTTCTCTCGACAGCCC 
TGGTCCCTTTTTCCGATGGACAAAGAGCTTGGATTTTTTCACCAAGTACCGCTCTGTGCCCAATTCGGAGCCTGC 
NAGGTTTTACCCCCCCCTTTAAAAAAGCCCCCTTTGGTTATATACCCCTTACCCTTACCCCCCGTGGGTTACCCC 
TTNTATACGGAAACCCCCCCCTGGTCGGCCTTCCCCTTTATGGGGCCCCGGTCTTAAACCTTGGGGAAACAGAAN 
AACTTCCGGAAAGGGACCCCCATTTCGATTTACCCCGCAAAAGCCCCCCNCTTTTCCCAAANGGGATACCGGCCA 
10 AGTTNTAAAGGAAACCCTTTTAANATTTGGATTGAGGATTATTTATTATAGGGGCCCTATTAAAACAAACAACAA 
CACTTCTTTTTATCCACAAAGTGGGNGGGATNTAAATGATTTATCAAANCCCGTTTGTGGTAACAAAACAATTTT 
TTTACAAAACGNCCCGGGAGTATATTAGTTATAAAAAAAATATTCCTCCCCGAAAAACATTATTCTCCCCCCCCC 
CTTATATATTTTAACACCCCATATATTCAACNGCTATTTATTATTGCCACCATATTTTTACCCCATAAATA 

15 SEQIDNO:92 

>GL3_21C_1_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCATTGCCAGAAGACGTCTTCTCGGGGCGCCAGGATTCACCTTTCCTTCCCGACCTCAA 

CTTCTTCGCGGCCGACTCCTGTCTCCAGCTATCCGGAGGCCTCTCCCCCTTAGCCTACAGGGTTCTCTCGACAGC 

CCGGTCCCGGTTTTTTATTTCCCCGGAAGGGGGAAGGGAAGGCTGGGGATTGTGTTTCGACGCAGTACGCTCTGT 

20 GCCATCGACCTGCAGTTTCCCTAACCTGTTTCCTCCTCCCGGTCCTTTCGACCCCCGCGCTCCTTGGCCGTCTAC 
TGGAA 

SEQ ID NO:93 

>GL3_11D_1_M13F Direction: anti-sense 

25 AAAANGTATAGACTCACTATAGTGGCGAATTGGGMCCATCTAGATGCATGCTCGAGCGGCCGCCAGGTGTGATGG 
ATATCTGCAGAATTGCTGCCGCTTACGACAGTCAGTGAGTATCAGACCTCAGAGCTAGATCGAGCGGCCGCTTTA 
TCTATCTAACCACTGTGCTGGGTTCGTGCGGNCCCCGCTAGAGTTTAATGTATTCCTGAGATTTCACTGGACAGG 
AGTCTACCAAACGGAATTTTTCTGTGTGAATTTTAAAAGATAACCGAGTGCCCAATATTTTAGAACGAAGAAGAA 
AGGGAGTGGATTAAACGCTAAGTTCAGTAATACCNTGAAGTTTTTAGCAAAAGCGACATAAGTTCTATGGCGACT 

30 GAGGGGTGGGAGAGGCTCGACGNNTTTNACCAAGTAGACGGGCCAAGGAAGGCGCGGNGGGGTCCGGAAACAMGG 
GACCCGGGGCAGGGGGAGGGGAAAAACCAGGGTTACAGCGNGAAAAACCTGGCCAAGGGACTACCCGGAACGTAT 
GAGGGCCAAACAAAAGAAAGGCCGGATTAACCCTATGGGGGATTGGAAAAAAAAAATCCCCAAAGGGCTTCCTAT 
ATCCCCCTCCGGGGGAAAAAAACACAGGGGGGGAAACCCCCGGGAGGCCCTATGGATCCAGGACACGAAAGCGAA 
AACCCCCCTGGGTAAGAGAGCCCTAAGGGGGGGA 

35 

SEQ ED NO:94 

>GL3_27D_2_M13R Direction: anti-sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 

40 AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 
CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 
GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTTGAGGTCGGGAAGGAAAGGTGAATCCTGGCGCCCCGAGAAG 
ACGTCTTCTGGCAATGGCCCGGGCTCCAGCCCGACGGCCCCGCGCAGCGCGCAAGCCGGGACTTCGAACGCACNC 

45 NTGCAGCCCTCATAAGCGAACGGCATAAAAGGCCCGGGGGGTCCAGCGCATTAAGGGCCCCCCGGACCATCCTCC 
GGGCGAGGTGAACAAGCGGGGTTACCCAAGGCCGGCGGAGAGGTCCCGAGGCCATTNGACTAGGGNT 

SEQ ID NO:95 

>GL3_45B_3_M13F Direction: N/A 

50 TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGGTTTACACCAACACGCAGCTGCTGGAACTGGAGAAG 
GAATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGG 
CAGGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGG 
GAGCCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCTGCCGAGGAACCCTGCGGCCAGCCCGGGCGGCCC 

55 CTCCGCCTCGCGGGCGGCGTGGGAAGCCTGCTGTCACCCGCCGGAGGTGGTTGCCGGGGGCCC 

SEQ ID NO:96 

>GI»3_27D_1 — M13R Direction: anti-sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
60 AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 
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AATTTTAGCAAAAC^CATAAGTCT^ 
CGTGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 
GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTTGAGGTCGGGAAGGAAAGGTAATTCCTTCTCCAGTTCCAGC 
AGCTGCGTGTTGGTGTAAGCCGTG 

SEQ ID NO:97 

>GL3_45B_1_M13R Direction: sense 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGTTACACCAACACGCAGCTGCTGGAACTGGAGAAGGA 
ATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGCA 
GGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGGA 
GCCTTGCCTTGCCCGGGAGCCCTTGGAGGACATCTGCGACCCTGCCGAGGAACCCGCGGGCCAGCCCGGGCGGCC 
CCTTCCGCCTCGCGGGCGGGCG 

SEQ ID NO:98 

>GL3_11D_3_M13R Direction: anti -sense 

CCCCC CTAGAGATTCTAATGTACTTCCTGAGATTATCACTGGAAGGAGATCTAC CAAACGGAATTTCTTC CGTGT 
GAATTTTAACAGAGATAACCGAGTGCCCATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGT 
AATACCTGAATTTTAGCAAAGACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACG 
GCCAAGGAGCGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGACTGGCACAGAG 
CGTACTGGTGAAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGGTAGGCTAGGGG 
AGAGGCCTCCGGATAGCTGGAGACAGGAGTCGGCCGCGAANGAAGTTGAGGTCGGGAAGGAAAGGTGAATCC 

SEQ ID NO:99 

>GL3_11D_ 4_M13F Direction: sense 

AAATTGGAAATAGACTCACTATAGGGCGAATTGGGCCCTCTAGAATGACATGCTCGAGCGGCCAGNCCAAGGTGT 
GATGGATATCTGCAGAATTCTGCCCTTACGACAGTCAGTGAGTATCAGACCTCAGAGCTAGATCGTAGCGGCCGC 
TTTATCTATCTAACCACTGTGCTGGGTNTCTGCGGCCCCCGCTAGAGTTTAATTATTCCTGAGATTTCACTGGCC 
AGGGAGTCTACCAACAGGGAATTTTTCTGTGTGAATTTTAAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGA 
AGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTGAATTTTAGCAAAACACATAAGTCTATGCGACTGAGG 
GTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAGCGCGGGGGNCGAAAGGACCGGGAGGAGGAAACAGGT 
TAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTGAAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGG 
GTGTAAGAGAGAACCCTGGNAAGGCCAGCGGGA 

SEQ ID NO: 100 

>GL3__41D_l_M13R Direction: anti-sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAGACGGAATTTTTCTGTGTGAATTTT 
AAGAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCT 
GAATTTTAGCAAAACACATAACGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTGTCCAGTACGACGGCCAA 
GGAGCGCGCGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTAC 
TGGTGAAAAAATCCAGCTCTACCTCGGAAAAAGGGACCGGGACCNGATCGAGAGAACCCTGTAGGCTAGGGGAGA 
GGCCATCCGGATAGACTGGAGACAGGAGTCGGAACGCGAAGAAGTACGAGGTCGNGGAAGG 



>GL3_41D_3_M13R Direction: anti -sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 
AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGGGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 
CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTNAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 
GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTNGAGGTCGGGAAGGAAAGCGTGAAAGTCCCTGGCACAGCCA 
CCGAAGGCAAAGAACGGATCNTTTCATNGGACAAAAATGGGACCCAGGAGCATCCCAAGCACCCGGANCGGGGCG 
CCCCGGACGCANGCAGNAGCGCAAAGACAGGGNGAACTCACGAAAGGCAGNACNTGACGACCCCATTCCAAAAAG 
GAGAGAACAANGGGNATAAAAAGGGACCAGGAGAAGANCAAG 

SEQIDNO:102 

>GL3_41D_4_M13R Direction: anti -sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
AAAAGATAACCGAGTGCCCT^TATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 



SEQ ID NO: 101 
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aattttagcaaaacacatj^tctatgcgactgagggtgggagaggctcgatttWccagtagacggccaaggag 
cgcgggggtcgaaaggaccgggaggaggaaacaggttagggaaactgcaggtcgatggcacagagcgtactggtg 
aaaaaatccagctcttcctcggaaaaagggaccgggctgtcgagagaaccctgtaggctaggggagaggcctccg 
gatagctggagacaggagtcggccgcgaagaagttcgaggtc 

SEQIDNO:103 

>GLi3_45B_l_M13F Direction: anti-sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 

AAAAGATAACCGAGTGCCCAATATTTTAGAAGAGGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 

10 AATTTTAGCAAAACACGTAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 

CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 

AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 

GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTTGAGGTCGGGAAGGAAAGGTGAATCCTGGCGCCCCGAGAAG 
AC 

15 

SEQIDNO:104 

>GL3_45B_3_M13R Direction: anti-sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 

AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 

20 AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTNTCCAGTAGACGGCCAAGGAG 

CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTGA 

AAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGACTGTCGAGAGAACCCTGNAGGCTAGGGGAGAGGCCTCCG 

GATAGCTGGAGACAGGAGTCGGACCGCGAAGCAAGTCTGAGGTCGGGAAGGAAAGGTGAATCCTGGCGCCCCGAG 
AAGACGTCTTCT 

25 

SEQIDNO:105 

>GL3_45B_4_M13R Direction: anti-sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 
30 AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 
CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGANGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCATGTCGAGAGAACCCTGGANGGCTAGGGGAGAGGCCTA 
CCGGATAGCNGGAGACAAGGAGNCGGACG 

35 SEQIDNO:106 

>GL3__27D_3_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCATTGCCAGAAGACGTCTTCTCG 

GGGCGCCAGGATTCACCTTTCCTTCCCGACCTCAACTTCTTCGCGGCCGACTCCTGTCTCCAGCTATCCGGAGGC 

CTCTCCCCTAGCCTACAGGGTTCTCTCGACAGCCCGGTCCCTTTTTCCGAGGAAGAGCTGGATTTTTTCACCAGT 

40 ACGCTCTGTGCCATCGACCTGCAGTTTCCCTAACCTGTTTCCTCCTCCCGGTCCTTTCGACCCCCGCGCTCCTTG 

GCCGTCTACTGGAAAAATCGAGCCTCTCCCACCCTCAGTCGCATAGACTTATGTGTTTTGCTAAAATTCAGGTAT 

TACTGAATTAGCGTTTAATCCACTCCCTTTCTTCTTCTTCTAAAATATTGGGCACTCGGTTATCTTTTANAAATT 

TCACACAGAAAATTCCGTTTGGTAGACTCCTTCCAGTGAAATCTCAGGAATTATTAACTCTAGGGGGGCCGCAAG 
A 

45 

SEQIDNO:107 

>GL3_2 7D_3_M13R Direction: anti- sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 

50 AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 
CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 
GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTTGAGGTCGGGAAGGAAAGGTGAATCCTGGCGCCCCGAGAAG 
ACGTCTTCTGGCAATGGGTTTTCTTCTCTCCCTCTCTAGTCTACAGCCCCGGCCCGGGTCAGGGCAAACCGGAGA 

5 5 ACCCAGCACACTGGAGCCACCGNGTGCCACCGGCTTGG 

SEQIDNO:108 

>GIi3_27D_4_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCATTGCCAGAAGACGTCTTCTCG 
60 GGGCGCCAGGATTCACCTTTCCTTCCCGACCTCAACTTCTTCGCGGCCGACTCCTGTCTCCAGCTATCCGGAGGC 
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CTCTCCCCTAGCCTAC^GGGTTCTCTCGACAGCCCGGTCCCTTTTTCCGAGGAAGA^TGGATTTTTTCACCAGT 
ACGCTCTGTGCCATCGACCGTGCAGTTTCCCTAACCTGTTTCCTCCTCCCGGTCCTTTCGACCCCCGCGCTCCTT 
GGCCGTCTACTGGAAAAATCGAGCCTCTCCCACCCTCAGTCGCATAGACTTATGTGTTTTGCTAAAATTCAGGTA 
TTACTGAATTAGCGTTTAATCCACTCCCTTTCTTCTTCTTCTAAAATATTGGGCACATCGGTTATCTTTTTAAAA 
TTCACACAGAAAAATTCCGTTAGGG 



>GL3_27D_4_M13R Direction: ant i -sense 
CCCCCCTAGAGTTTAATTATTCCTG 

AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 
AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 
CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 
GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTTGAGGTCGGGAAGGAAAGGTGAATCCTGGCGCCCCGAGAAG 
ACGTCTTCTGGCAATGGGTTTTCTTCTCTCCCTCTCTAGTCTACAGCCCGTGGCCGGGGTCAGGGCAAACCGGAG 
AAACCAGCACACTGGAGCCA 



>GL3_21C_1_M13R Direction: anti- sense 

CCCCCCTAGAGTTTAATTATTCCTGAGATTTCACTGGAAGGAGTCTACCAAACGGAATTTTTCTGTGTGAATTTT 
AAAAGATAACCGAGTGCCCAATATTTTAGAAGAAGAAGAAAGGGAGTGGATTAAACGCTAATTCAGTAATACCTG 
AATTTTAGCAAAACACATAAGTCTATGCGACTGAGGGTGGGAGAGGCTCGATTTTTCCAGTAGACGGCCAAGGAG 
CGCGGGGGTCGAAAGGACCGGGAGGAGGAAACAGGTTAGGGAAACTGCAGGTCGATGGCACAGAGCGTACTGGTG 
AAAAAATCCAGCTCTTCCTCGGAAAAAGGGACCGGGCTGTCGAGAGAACCCTGTAGGCTAGGGGAGAGGCCTCCG 
GATAGCTGGAGACAGGAGTCGGCCGCGAAGAAGTTGAGGTCGGGAAGGAAAGGTGAATCCTGGGCGCCCCGAGAA 
GACGTCTTCTAGGCAATGGCCCGGGTCAGGGCAAA 

SEQIDNO:lll 

>GL3_41D_3_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGGAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGTTTACACCAACACGCAGCTGCTGGAACTGGAGAAGG 
AATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGC 
AGGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGG 
AGCCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCTGCCGAGGAACCCGCGGCCAGCCCGGGCGGCCCCT 
TCCGCCTCGCGGGCGGCGTGGGAAGCCTGCTGTCACCCGCCGGAGGTGGTGCCGGGGGCCTTAAGCGCGGACCCC 
GGCCTTTAGCCGTTCGC 

SEQIDNO:112 

>GL3_41D_4_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGCTTACACCAACACGCAGCTGCTGGAACTGGAGAAGG 
AATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGC 
AGGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGG 
AGCCATGCCTGCCCGGGAGCCCTTGGAGGACATCTGCGACCCTTGCCGAGGAACCCGCGGGCCAGCCCGGNNGCG 
GCCCCATCCGCCTCG 



>GL3_41D_PCR_G3F1 Direction: N/A 

TTTGCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGCC 
GGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGCTTACACCAACACGCAATTCTGGAACTGGAGAAGGAA 
TTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGCAG 
GTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGGAG 
CCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCTGCCGAGGAACCCGCGGCCAGCCCGGGCGGCCCCTCC 
GCCTCGCGGGCGGCGTGGGAAGCCTGCTGTCACCCGCCGGAGGTGGTGCCGGGGGCCTTAAGCGCGGACCCCCGG 
CCTTTAGCCGTTCGCTTAGAGGGCGCAGGCGCGTCGAGTCCCGGCTGCGCGCTGCGCGGGGCCGGCGGGCTGGAG 
CCCGGGCCATTGCCAGAAGACGTCTTCTCGGGGCGCCAGGATTCACCTTTCCTTCCCGACCTCAACTTCTTCGCG 
GCCGACTCCTGTCTCCAGOTATCCGAGGCCTCTCCCCTAGCCTACAGGGGTTCTCTCGACAGCCCGGTCCCTTTT 
TCCGAGGAAGAGCTGGATTTTTTCACCAGTACGCTCTGTGCCATCGAC 



SEQIDNO:109 



SEQIDNO:110 



SEQIDNO:113 
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>GL3_41E_PCR__G3F1 Direction: sense 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGCTTACACCAACACGCAGCTCTGGAACTGGAGAAGGA 
5 ATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGCA 
GGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGGA 
GCCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCTGCCGAGGAACCCGCGGCCAGCCCGGGCGGCCCCTC 
CGCCTCGCGGGCGGCGTGGGAAGCCTGCTGTCACCCGCCGGAGGTGGTGCCGGGGGCCTTAAGCGCGGACCCCCG 
GCCTTTAGCCGTTCGCTTAGAGGGCGCAGGCGCGTCGAGTCCCGGCTGCGCGCTGCGCGGGGCCGGCGGGCTGGA 
10 GCCCGGGCCATTGCCAGAAGACGTCTT ' 

SEQIDNO:115 

>GL3_45B_4_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
15 CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGCTTACACCAACACGCAGCTGCTGGAACTGGAGAAGG 
AATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGC 
AGGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGG 
AGCCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCCTTGCCGAGGAACCCGCGGCCAGCCCGGGCGGCCC 

20 SEQIDNO:116 

>GL3_45B_PCR_G3F1 Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGCTTACACCAACACGCAGCTGCTGGAACTGGAGAAGG 
AATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAAAGGC 

25 

AGGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGCCGGATGGGG 
AGCCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCTGCCGAGGAACCCGCGGCCAGCCCGGGCGGCCCCT 
CCGCCTCGCGGGCGGCGTGGGAAGCCTGCTGTCACCCGCCGGAGGTGGTGCCGGGGGCCTTAAGCGCGGACCCCC 
GGCCTTTAGCCGTTCGCTTAGAGGGCGCAGGCGCGTCGAGTCCCGGCTGCGCGCTGCGCGGGGCCGGCGGGCTGG 
AGCCCGGGCCATTGCCAGAAGACGTCTTCTCGGGGCG 

30 

SEQEDNO:117 

>GL3_27D_1_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAGAAGCAATGGCCTGGGACTGC 
CGGAGGCCTTGGTTGGCGGCGGGGCGCGCAGGCTTGCGGCCAACGGTTTTACACCAAACACGCAGCTGCTTGGAA 

35 

CTGGAGAAAGGAAATTAACCCTTTTTCCTTTCCCGAACCTTCAAACTTTCTTTCGGCGGGCCGGACTTCCTTGTC 
TCCAGCTTATCCGGAGAGCGCCTTCTCCCCATTAGCCCTAACAAGGGGTTCTCTCNGGACAGGCCCGCGGGGTCC 
CCTTTTTTCCGGAGAAAGAGCTTGAATTTTTCCAACCAAGGTACGGCCCATTCGTTTGGTTGCCCCCAATTTCGT 
AGAACCCTTTGCACAAGGTTTTTTCCCCCTATAAAAACCCTGTGGTTTTTCCCCTTCCACCTTTCCCCGCGGGGG 
TATCCCTTTTTTCAGGAAGCCGCGCCNACAAAAGGGCCTTCCTATTTTGGGCCCGGTGTACTCTTTAGCGTGTGG 
40 GAAAACAGAATTCGTCGGGAAGGCCCTTTCTTCCACAAACCCCCCATTCNAGAGGTCCGGGGCAAGATTTAGAAA 
CTTTATTGGACTATGATTTGTGTTGAGGGCCCTTAAACACAAAATTTTCAAGGGGGATTATTTTTA 

SEQIDNO:118 

>GL3_27D_2_M13F Direction: N/A 

45 TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGCACTG 
CCTGGCGCTTTGGGAAGCGTGCTGTCACCCGTCCGGAGGGTGGATGCCGTGGGGTCCTATGAAGCTGCGGACCCC 
CGGTCTTTAGACCGTACGCTTAGAGCGGCGGACAGGCNCGTCGAGGTCCCGGCTGCGAACAGCGCGGAGCCCGGG 
CNGCATGAAGCCTCGTGTCNCAATTTNCCAAGAATGGAAGGGTTCTTACCCTTCNGAGGCGCGGCCCAAGGNGTA 
CTTCTTACCCCCTGTGTTTCCCTTTTCCCCGAAACCCTCTCGAAGTCTTTCCTTTACGGGCGNGGCCAGANACTC 

50 CGTTGTTCCTACCCTANGGATTACANCCTGAGAAGACCTTTTCNCACGCCTCNTTGAGGCCTCACTGTAAACNGG 
TGGGAGGGTTNGTCTCCGTTCTTNCGGGGACCTAAGCGGCCCCGCTGGGCGAACCGCGGTTTTTTTTAACCCTCA 
CCGGGAGAAAGNAGACAAAANGACAAAGGGTCTTTTGAGGAAGATTCCATCTTTNCACNACCCGTATAGATTAGT 
TAACGGGTCTCCCTTGTGCTTCGCCCACATATTTAAGTACCGGCTGTGGGCGAAGCGTTCTTGGCCCNCCAATTA 

^ CACAGTCCNCTTTGTGATATAACCCNCTTTCCACTCTTACCCGNGCGGGGGACTCCCACTTTTTTTTC 

SEQK)NO:119 

>Gli3_4lD_l_M13F Direction: N/A 

TTTGCCCTGACCCGGGCCGGGGCTGTAGACTAGAGAGGGAGAGAAGAAAACCCAAAAGCAATGGCCTGGGACTGC 
CGGAGGCTGGTGGCGGCGGGGCGCGCAGGCTGCGCNACGGTTACACTCAACTACGCAGCTGCTGGAACTTGGAGA 
60 AGGAATTTCCACTTTAATAAGTACCTTGTGTCCGGGCCACAGCCGGCGGTCAGAGATCNGCGGGCCTTTGCCTGG 
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ACCCTTCAGCCCGAAAAGGGCAAGGTCNAAGAGGCTCATAGGGTGTCC^GAGACn^CGGNGGCGGCCAATTGAAGA 
GG CAACGAGAGGCCGGCCAAGGAACGGCAGCAACCCGGAAGAAGGCCCGGCCGCNGGCATTGGGGGGGAAGCCTT 
NGCCCTTGGCCCCGGTGGTGAAGGTCCGCTTTGGGAANGGGAGCATATTCCTNGCGGAAACGCCTTGTCNCAGAA 
GGGTAGAAACCCCGTGACGGGGCCAAGGGCGCCCGGGGGCCGGCGGCCCCCCCTTTCCGGACCCATTTCCGGGCC 
5 GGGGNGCCGGGNGCAGTTG 

SEQIDNO:120 

>gi | 4504464 |ref[NM_002145.l| Homo sapiens homeo box B2 (HOXB2) , mRNA 
GAATTCCGGGGGGGGGTCCCTTCCGATCCTCCCTCCTGACGCCCCCCCCAGCAGCCCCCTCCCCCACCAT 

10 TGAAAGCCATGAATTTTGAATTTGAGAGGGAGATTGGGTTTATAAACAGCCAGCCGTCGCTCGCCGAGTG 
TCTGACTTCCTTCCCCGCTGTCTTGGAGACATTTCAAACTTCATCAATCAAGGAGTCGACATTAATTCCT 
CCTCCTCCTCCTTTCGAGCAAACCTTCCCCAGCCTCCAGCCCGGCGCCTCCACCCTTCAGAGACCCAGGA 
GCCAAAAGCGAGCCGAAGATGGGCCTGCTCTGCCGCCGCCACCGCCGCCGCCACTCCCCGCTGCCCCCCC 
GGCCCCCGAGTTCCCTTGGATGAAAGAGAAGAAATCCGCCAAGAAACCCAGCCAATCCGCCACGTCTCCT 

15 TCTCCGGCCGCCTCCGCCGTTCCGGCCTCCGGGGTCGGATCGCCTGCAGATGGCCTGGGACTGCCGGAGG 
CTGGTGGCGGCGGGGCGCGCAGGCTGCGCACGGCTTACACCAACACGCAGCTGCTGGAACTGGAGAAGGA 
ATTCCACTTTAATAAGTACCTGTGCCGGCCACGCCGCGTCGAGATCGCGGCCTTGCTGGACCTCACCGAA 
AGGCAGGTCAAAGTCTGGTTTCAGAACCGGCGCATGAAGCACAAGCGGCAGACGCAGCACCGAGAGCCGC 
CGGATGGGGAGCCTGCCTGCCCGGGAGCCCTGGAGGACATCTGCGACCCTGCCGAGGAACCCGCGGCCAG 

20 CCCGGGCGGCCCCTCCGCCTCGCGGGCGGCGTGGGAAGCCTGCTGTCACCCGCCGGAGGTGGTGCCGGGG 
GCCTTAAGCGCGGACCCCCGGCCTTTAGCCGTTCGCTTAGAGGGCGCAGGCGCGTCGAGTCCCGGCTGCG 
CGCTGCGCGGGGCCGGCGGGCTGGAGCCCGGGCCATTGCCAGAAGACGTCTTCTCGGGGCGCCAGGATTC 
ACCTTTCCTTCCCGACCTCAACTTCTTCGCGGCCGACTCCTGTCTCCAGCTATCCGGAGGCCTCTCCCCT 
AGCCTACAGGGTTCTCTCGACAGCCCGGTCCCTTTTTCCGAGGAAGAGCTGGATTTTTTCACCAGTACGC 

25 TCTGTGCCATCGACCTGCAGTTTCCCTAACCTGTTTCCTCCTCCCGGTCCTTTCGACCCCCGCGCTCCTT 
GGCCGTCTACTGGAAAAATCGAGCCTCTCCCACCCTCAGTCGCATAGACTTATGTGTTTTGCTAAAATTC 
AGGTATTACTGAATTAGCGTTTAATCCACTTCCTTTCTTCTTCTTCTAAAATATTGGGCACTCGGTTATC 
TTTTAAAATTCACACAGAAAAATTCCGTTTGGTAGACTCCTTCCAATGAAATCTCAGGAATAATTAAACT 
CTAGGGGGACTTTCTTAAAAATAACTAGAGGGACCTATTTTCCTCTTTTTTATGTTTTAGACTGTAGATT 

3 0 ATTTATTAAAATTCTTTAAT AATAGGAAAAGGGGAAAGTATTTATTGTAC 

SEQIDNO:121 

>gi | 4504465 | ref |NP_002 136 . 1 | homeo box B2 / Homeo box-B2 [Homo sapiens] 
MNFEFEREIGFINSQPSLAECLTSFPAVLETFQTSSIKESTLIPPPPPFEQTFPSLQPGASTLQRPRSQK 
35 RAEDGPALPPPPPPPIiPAAPPAPEFPWMKEKKSAKKPSQSATSPSPAASAVPASGVGSPADGLGLPEAGG 
GGARRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIAALLDLTERQV^^ 

EPACPGALED I CDPAEEPAAS PGGPS AS RAAWEACCHPPE WPGALSADPRPLAVRIjEGAGAS S PGCALR 
GAGGLEPGPLPEDVFSGRQDSPFLPDLNFFAADSCLQLSGGLSPSLQGSLDSPVPFSEEELDFFTSTLCA 
IDLQFP 



SEQEDNO:122 

HSPA5 

>GL2-67-2M13R Direction: N/A 

CCGATGAGCCGCTTGGCGTCAAAGACCGTGTTCTCGGGGTTGGAGGTGAGCTGGTTCTTGGCGGCATCGCCAATC 
45 AGACGTTCCCCTTCAGGAGTGAAGGCGACATAGGACGGCGTGATGCGGTTGCCCTGATCGTTGGCGATGATCTCC 
ACGCGGCCGCTCGCGATCTAGAAC 

SEQIDNO:123 

>GL.2-67-5M13F Direction: N/A 
50 GTTCTAGATCGCGAGCGGCCGCGTGGAGATCATCGCCAACGATCAGGGCAACCGCATCACGCCGTCCTATGTCGC 
CTTCACTCCTGAAGGGGAACGTCTGATTGGCGATGCCGCCAAGAACCAGCTCACCTCCAACCCCGAGAACACGGT 
CTTTGACGCCAGCGGCTCATCGG 

SEQIDNO:124 

55 >GL2-67-5M13R Direction: N/A 

CCGATGAGCCGCTTGGCGTCAAAGACCGTGTTCTCGGGGTTGGAGGTGAGCTGGTTCTTGGCGGCATCGCCAATC 
AGACGTTCCCCTTCAGGAGTGAAGGCGACATAGGACGGCGTGATGCGGTTGCCCTGATCGTTGGCGATGATCTCC 
ACGCGGCCGCTCGCGATCTAGAAC 
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SEQIDNO:125 



>GL2_1_A03_G3F1 Direction: sense 

GTTCTAGATCGCGAGCGGCCGCGTGGAGATCATCGCCAACGATCAGTGGCAACCGCATCACGCCGTCCTATGTCG 
CCTTCACTCCTGAAGGGGAACGTCTGATTGGCGATGCCGCCAAGAACCAGCTCACCTCCAAACCCGAGAACACGG 
5 TCTTTGACGCCAAGCGGCTCATCTGG 

SEQIDNO:126 

>GL2_1_A05 — G3F1 Direction: sense 

GTTCTAGATCGCGAGCGGCCGCGTGGAGATCATCGCCAACGATCAGGGGCAACCGCATCACGCCGTCCTATGTCG 
10 CCTTCACTCCTGAAGGGGAACGTCTGATTGGCGATGCCGCCAAGAACCAGCTCACCTCCAACCCCGAGAACACGG 
TCTTTGACGCCAAGCGGCTCATCGG 

SEQIDNO:127 

HSP5> 

1 5 ATGGAGGAGGACAAGAAGGAGGACGTGGGCACGGTGGTCGGCATCGACCT 

GGGGACCACCTACTCCTGCGTCGGCGTGTTCAAGAACGGCCGCGTGGAGA 

TCATCGCCAACGATCAGGGCAACCGCATCACGCCGTCCTATGTCGCCTTC 

ACTCCTGAAGGGGAACGTCTGATTGGCGATGCCGCCAAGAACCAGCTCAC 

CTCCAACCCCGAGAACACGGTCTTTGACGCCAAGCGGCTCATCGGCCGCA 
20 CGTGGAATGACCCGTCTGTGCAGCAGGACATCAAGTTCTTGCCGTTCAAG 

GTGGTTGAAAAGAAAACTAAACCATACATTCAAGTTGATATTGGAGGTGG 

GCAAACAAAGACATTTGCTCCTGAAGAAATTTCTGCCATGGTTCTCACTA 

AAATGAAAGAAACCGCTGAGGCTTATTTGGGAAAGAAGGTTACCCATGCA 

GTTGTTACTGTACCAGCCTATTTTAATGATGCCCAACGCCAAGCAACCAA 
25 AGACGCTGGAACTATTGCTGGCCTAAATGTTATGAGGATCATCAACGAGC 

CTACGGCAGCTGCTATTGCTTATGGCCTGGATAAGAGGGAGGGGGAGAAG 

AACATCCTGGTGTTTGACCTGGGTGGCGGAACCTTCGATGTGTCTCTTCT 

CACCATTGACAATGGTGTCTTCGAAGTTGTGGCCACTAATGGAGATACTC 

ATCTGGGTGGAGAAGACTTTGACCAGCGTGTCATGGAACACTTCATCAAA 
30 CTGTACAAAAAGAAGACGGGCAAAGATGTCAGGAAAGACAATAGAGCTGT 

GCAGAAACTCCGGCGCGAGGTAGAAAAGGCCAAACGGGCCCTGTCTTCTC 

AGCATCAAGCAAGAATTGAAATTGAGTCCTTCTATGAAGGAGAAGACTTT 

TCTGAGACCCTGACTCGGGCCAAATTTGAAGAGCTCAACATGGATCTGTT 

CCGGTCTACTATGAAGCCCGTCCAGAAAGTGTTGGAAGATTCTGATTTGA 
3 5 AGAAGTCTGATATTGATGAAATTGTTCTTGTTGGTGGCTCGACTCGAATT 

CCAAAGATTCAGCAACTGGTTAAAGAGTTCTTCAATGGCAAGGAACCATC 

CCGTGGCATAAACCCAGATGAAGCTGTAGCGTATGGTGCTGCTGTCCAGG 

CTGGTGTGCTCTCTGGTGATCAAGATACAGGTGACCTGGTACTGCTTGAT 

GTATGTCCCCTTACACTTGGTATTGAAACTGTGGGAGGTGTCATGACCAA 
40 ACTGATTCCAAGGAACACAGTGGTGCCTACCAAGAAGTCTCAGATCTTTT 

CTACAGCTTCTGATAATCAACCAACTGTTACAATCAAGGTCTATGAAGGT 

GAAAGACCCCTGACAAAAGACAATCATCTTCTGGGTACATTTGATCTGAC 

TGGAATTCCTCCTGCTCCTCGTGGGGTCCCACAGATTGAAGTCACCTTTG 

AGATAGATGTGAATGGTATTCTTCGAGTGACAGCTGAAGACAAGGGTACA 
45 GGGAACAAAAATAAGATCACAATCACCAATGACCAGAATCGCCTGACACC 

TGAAGAAATCGAAAGGATGGTTAATGATGCTGAGAAGTTTGCTGAGGAAG 

ACAAAAAGCTCAAGGAGCGCATTGATACTAGAAATGAGTTGGAAAGCTAT 

GCCTATTCTCTAAAGAATCAGATTGGAGATAAAGAAAAGCTGGGAGGTAA 

ACTTTCCTCTGAAGATAAGGAGACCATGGAAAAAGCTGTAGAAGAAAAGA 
5 0 TTGAATGGCTGGAAAGCCACCAAGATGCTGACATTGAAGACTTCAAAGCT 

AAGAAGAAGGAACTGGAAGAAATTGTTCAACCAATTATCAGCAAACTCTA 

TGGAAGTGCAGGCCCTCCCCCAACTGGTGAAGAGGATACAGCAGAACTCC 

ACCACCACCACCACCAC 



(glucose-regulated protein, 78kD) (HSPA5) , mRNA 

ACAGCACAGACAGATTGACCTATTGGGGTGTTTCGCGAGTGTGAGAGGGAAGCGCCGCGGCCTGTATTTC 
TAGACCTGCCCTTCGCCTGGTTCGTGGCGCCTTGTGACCCCGGGCCCCTGCCGCCTGCAAGTCGGAAATT 
60 GCGCTGTGCTCCTGTGCTACGGCCTGTGGCTGGACTGCCTGCTGCTGCCCAACTGGCTGGCAAGATGAAG 
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CTCTCCCTGGTGGCCGCGATGCTGCTGCTGCTCAGCGCGGCGCGGGCCGAGGAGGAGGACAAGAAGGAGG 
ACGTGGGCACGGTGGTCGGCATCGACCTGGGGACCACCTACTCCTGCGTCGGCGTGTTCAAGAACGGCCG 
CGTGGAGATCATCGCCAACGATCAGGGCAACCGCATCACGCCGTCCTATGTCGCCTTCACTCCTGAAGGG 
GAACGTCTGATTGGCGATGCCGCCAAGAACCAGCTCACCTCCAACCCCGAGAACACGGTCTTTGACGCCA 
5 AGCGGCTCATCGGCCGCACGTGGAATGACCCGTCTGTGCAGCAGGACATCAAGTTCTTGCCGTTCAAGGT 
GGTTGAAAAGAAAACTAAACCATACATTCAAGTTGATATTGGAGGTGGGCAAACAAAGACATTTGCTCCT 
GAAGAAATTTCTGCCATGGTTCTCACTAAAATGAAAGAAACCGCTGAGGCTTATTTGGGAAAGAAGGTTA 
CCCATGCAGTTGTTACTGTACCAGCCTATTTTAATGATGCCCAACGCCAAGCAACCAAAGACGCTGGAAC 
TATTGCTGGCCTAAATGTTATGAGGATCATCAACGAGCCTACGGCAGCTGCTATTGCTTATGGCCTGGAT 
10 AAGAGGGAGGGGGAGAAGAACATCCTGGTGTTTGACCTGGGTGGCGGAACCTTCGATGTGTCTCTTCTCA 
CCATTGACAATGGTGTCTTCGAAGTTGTGGCCACTAATGGAGATACTCATCTGGGTGGAGAAGACTTTGA 
CCAGCGTGTCATGGAACACTTCATCAAACTGTACAAAAAGAAGACGGGCAAAGATGTCAGGAAAGACAAT 
AG AGCTGTGCAGAAACTCCGGCGCGAGGTAGAAAAGGCCAAACGGG C CCTGTCTTCTCAGCATCAAGCAA 
GAATTGAAATTGAGTCCTTCTATGAAGGAGAAGACTTTTCTGAGACCCTGACTCGGGCCAAATTTGAAGA 
15 GCTCAACATGGATCTGTTCCGGTCTACTATGAAGCCCGTCCAGAAAGTGTTGGAAGATTCTGATTTGAAG 
AAGTCTGATATTGATGAAATTGTTCTTGTTGGTGGCTCGACTCGAATTCCAAAGATTCAGCAACTGGTTA 
AAGAGTTCTTCAATGGCAAGGAACCATCCCGTGGCATAAACCCAGATGAAGCTGTAGCGTATGGTGCTGC 
TGTCCAGGCTGGTGTGCTCTCTGGTGATCAAGATACAGGTGACCTGGTACTGCTTGATGTATGTCCCCTT 
ACACTTGGTATTGAAACTGTGGGAGGTGTCATGACCAAACTGATTCCAAGGAACACAGTGGTGCCTACCA 
20 AGAAGTCTCAGATCTTTTCTACAGCTTCTGATAATCAACCAACTGTTACAATCAAGGTCTATGAAGGTGA 
AAGACCCCTGACAAAAGACAATCATCTTCTGGGTACATTTGATCTGACTGGAATTCCTCCTGCTCCTCGT 
GGGGTCCCACAGATTGAAGTCACCTTTGAGATAGATGTGAATGGTATTCTTCGAGTGACAGCTGAAGACA 
AGGGTACAGGGAACAAAAATAAGATCACAATCACCAATGACCAGAATCGCCTGACACCTGAAGAAATCGA 
AAGGATGGTTAATGATGCTGAGAAGTTTGCTGAGGAAGACAAAAAGCTCAAGGAGCGCATTGATACTAGA 
25 AATGAGTTGGAAAGCTATGCCTATTCTCTAAAGAATCAGATTGGAGATAAAGAAAAGCTGGGAGGTAAAC 
TTTCCTCTGAAGATAAGGAGACCATGGAAAAAGCTGTAGAAGAAAAGATTGAATGGCTGGAAAGCCACCA 
AGATGCTGACATTGAAGACTTCAAAGCTAAGAAGAAGGAACTGGAAGAAATTGTTCAACCAATTATCAGC 
AAACTCTATGGAAGTGCAGGCCCTCCCCCAACTGGTGAAGAGGATACAGCAGAAAAAGATGAGTTGTAGA 
CACTGATCTGCTAGTGCTGTAATATTGTAAATACTGGACTCAGGAACTTTTGTTAGGAAAAAATTGAAAG 
30 AACTTAAGTCTCGAATGTAATTGGAATCTTCACCTCAGAGTGGAGTTGAAACTGCTATAGCCTAAGCGGC 
TGTTTACTGCTTTTCATTAGCAGTTGCTCACATGTCTTTGGGTGGGGGGGAGAAGAAGAATTGGCCATCT 
TAAAAAGCAGGTAAAAAACCTGGGTTAGGGTGTGTGTTCACCTTCAAAATGTTCTATTTAACAACTGGGT 
CATGTGCATCTGGTGTAGGAAGTTTTTTCTACCATAAGTGACACCAATAAATGTTTGTTATTTACACTGG 
TCTAATGTTTGTGAGAAGCTTCTAATTAGATCAATTACTTATTTTAGGAAATTTAAGACTAGATACTCGT 
35 GTGTGGGGTGAGGGGAGGGAGTATTTGGTATGTTGGGATAAGGAAACACTTCTATTTAATGCTTCCAGGG 

GACCACAGGCTTGTTACCACCATGCTCGGCTTTTGCATTAATCTAAGAAAAGGGGAGAGAAGTTAATCCA 
CATCTTTACTCAGGCAAGGGGCATTTCACAGTGCCCAAGAGTGGGGTTTTCTTGAACATACTTGGTTTCC 
TATTTCCCCTTATCTTTCTAAAACTGCCTTTCTGGTGGCTTTTTTTAAAATTATTACTAATGATGCTTTT 

40 ATAGCTGCTTGGATTCTCTGAGAAATGATGGGGAGTGAGTGATCACTGGTATTAACTTTATACACTTGGA 
TTTCATTTGTAACTTTAGGATGTAAAGGTATATTGTGAACCCTAGCTGTGTCAGAATCTCCATCCCTGAA 
ATTTCTCATTAGTGGTACTGGGGTGGGATCTTGGATGGTGACATTGAAACTACACTAAATCCCCTCACTA 
TGAATGGGTTGTTAAAGGCAATGGTTTGTGTCAAAACTGGTTTAGGATTACTTAGATTGTGTTCCTGAAG 
AAAAGAGTCCAGGTAAATGGTATGATCAATAAAGGACAGGCTGGTGCTAACATAAAATCCAATATTGTAA 

45 TCCTAGCACTTTGGGAGGCCAAGGCGGGTGGATCACAAGGTCAAGAGATAGAGACCATCTTTGCCAACAT 
GGTGAAACTCCATCTCTACTGAAAATACAAT^AATTAGCTGGGCGTGGTAGTGCAAGCTGAAGGCTGAGGC 
AGGAGAATCACTCGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGATCACACCACTGTACTCCAGCCCG 
GCACTCCAGCCTGGCGACAAGAGTGAGACTCCACCTCAAAAAAAAAAAAAAGAATCCAATACTGCCCAAG 
GATAGGTATTTTATAGATGGGCAACTGGCTGAAAGGTTAATTCTCTAGGGCTAGTAGAACTGGATCCCAA 

50 CACCAAACTCTTAATTAGACCTAGGCCTCAGCTGCACTGCCCGAAAAGCATTTGGGCAGACCCTGAGCAG 
AATACTGGTCTCAGGCCAAGCCCAATACAGCCATTAAAGATGACCTACAGTGCTGTGTACCCTGGGGCAA 
TAGGGTTAAATGGTAGTTAGCAACTAGGGCTAGTCTTCCCTTACCTCAAAGGCTCTCACTACCGTGGACC 
ACCTAGTCTGTAACTCTTTCTGAGGAGCTGTTACTGAATATTAAAAAGATAGACTTCAAAAAAAAAAAAA 
AAAAA 



SEQIDNO:129 

>gi 1 16507237 | ref |NP_005338 . 1 1 heat shock 70kD protein 5 (glucose-regulated 
protein, 78kD) ; BiP; Heat-shock 70kD protein-5 (glucose -regulated protein, 
78kD) [Homo sapiens] 
60 MKLSLVAAMLLLLSAARAEEEDK3CEDTO 

EGERLIGDAAKNQLTSNPENTVFDAKRLIGR^ 



55 



33 



WO 2004/019893 PCT/US2003/027523 
apeeisamvltkmketae.^BKkkvthavvtvpayfndaqr 

LDKREGEKNILVFDIiGGGTFDVS LLTIDNGVFE WATNGDTHLGGEDFDQRVMEHF I KLYKKKTGKDVRK 
DNRAVQKLRREVEKAKRALSSQHQARIEIESFYEGEDFSETLTRAJCFEELNM^ 

LKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEAVAYGAAVQAGVLSGDQDTGDLVLLDVC 
PLTLG I ETVGGVMTKL I PRNT WPTKKS Q I FSTASDNQPTVT I KVYEGERPLTKDNHLLGTFDLTG I PPA 
PRGVPQIEVTFEIDWGILRVTAEDKGTGNKNKITITNDQNRLTPEEIERMWDAEKFAEEDKKLKERID 
TRJ^LESYAYSLKNQIGDKEKLGGKLSSEDKETMEKAVEEKIEWLESHQDADIEDFKAKK^ 
ISKLYGSAGPPPTGEEDTAEKDEL 



10 SEQIDNO:130 

IFNgRl 

>G3_2_3 7_PCR__G3F1 Direction: sense 

CTGGCTTTAACTCTGACCCAAAGTAGTATTTGATGGATCACCAACATGTATCAGTAAATATTACAATAATGNATG 
TAGTAATATTGATGCAGGCATCAATCCATTCTGAATTCTTAACACCATAGTTCTTTACCTCTACGGTAAAAACAG 
15 GGACCTGTGGCATGATCTGGTACTCCCAATATACGATAGGGTTCATGTTATAGGATTCAATTGTAACATTAGTTG 
GTGTAGGCACTGAGGACGGCCCCAGATCCGCGGTGCCCATCTCAGCCCTGCTCACACCCTGCATGACAAGGGGTA 
GGAGAAAGAGGAGAGCCATGCTGCTACCGACGGTCGCTGGCTCCAACCCCGAGCGCCTGCGGGACCAGCCCAGCG 
CTGCCC 



20 SEQIDNO:131 

>GLi3_2 . 37_A_M13F Direction : sense 

CTGGCTTTAACTCTGACCCAAAGAGAATTTGATGGATCACCAACATGATCAGAAATATTACAATAATGATGAGAA 
ATATTGATGCAGGCATCAATCCATTCTGAATTCTTAACACCATAGTTCTTTACCTCTACGGTAAAAACAGGGACC 
TGTGGCATGATCTGGTACTCCCAATATACGATAGGGTTCATGTTATAGGATTCAATTGTAACATTAGTTGGTGTA 
25 GGCACTGAGGACGGCCCCAGATCCGCGGTGCCCATCTCAGCCCTGCTCACACCCTGCATGACAAGGGGTAGGAGA 
AAGAGGAGAGCCATGCTGCTACCGACGGTCGCTGGCTCCAACCCCGAGCGCCTGCGGGACCAGCCCAGCGCTGCC 
C 



SEQIDNO:132 

30 >GIi3_2.37_A_M13R Direction: anti-sense 

GGGCAGCGCTGGGCTGGTCCCGCAGGCGCTCGGGGTTGGAGCCAGCGACCGTCGGTAGCAGCATGGCTCTCCTCT 
TTCTCCTACCCCTTGTCATGCAGGGTGTGAGCAGGGCTGAGATGGGCACCGCGGATCTGGGGCCGTCCTCAGTGC 
CTACACCAACTAATGTTACAATTGAATCCTATAACATGAACCCTATCGTATATTGGGAGTACCAGATCATGCCAC 
AGGTCCCTGTTTTTACCGTAGAGGTAAAGAACTATGGTGTTAAGAATTCAGAATGGATTGATGCCTGCATCAATA 

35 TTTCTCATCATTATTGTAATATTTCTGATCATGTTGGTGATCCATCAAATTCTCTTTGGGTCAGAGTTAAAGCCA 
G 



SEQIDNO:133 

>GL3_2 . 37JB_M13F Direction: anti-sense 

40 GGGCAGCGCTGGGCTGGTCCCGCAGGCGCTCGGGGTTGGAGCCAGCGACCGTCGGTAGCAGCATGGCTCTCCTCT 
TTCTCCTACCCCTTGTCATGCAGGGTGTGAGCAGGGCTGAGATGGGCACCGCGGATCTGGGGCCGTCCTCAGTGC 
CTACACCAACTAATGTTACAATTGAATCCTATAACATGAACCCTATCGTATATTGGGAGTACCAGATCATGCCAC 
AGGTCCCTGTTTTTACCGTAGAGGTAAAGAACTATGGTGTTAAGAATTCAGAATGGATTGATGCCTGCATCAATA 
TTTCTCATCATTATTGTAATATTTCTGATCATGTTGGTGATCCATCAAATTCTCTTTGGGTCAGAGTTAAAGCCA 

45 G 



SEQIDNO:134 

>GL3_2 . 37_B_M13R Direction: sense 

CTGGCTTTAACTCTGACCCAAAGAGAATTTGATGGATCACCAACATGATCAGAAATATTACAATAATGATGAGAA 
50 ATATTGATGCAGGCATCAATCCATTCTGAATTCTTAACACCATAGTTCTTTACCTCTACGGTAAAAACAGGGACC 
TGTGGCATGATCTGGTACTCCCAATATACGATAGGGTTCATGTTATAGGATTCAATTGTAACATTAGTTGGTGTA 
GGCACTGAGGACGGCCCCAG ATCCGCGGTGCCCATCTCAGC C CTGCTCAC ACC CTGCATGACAAGGGGTAGGAGA 
AAGAGGAGAGCCATGCTGCTACCGACGGTCGCTGGCTCCAACCCCGAGCGCCTGCGGGACCAGCCCAGCGCTGCC 
C 

55 

SEQIDNO:135 

>GIi3_2 . 37_C_M13F Direction: anti-sense 

GGGCAGCGCTGGGCTGGTCCCGCAGGCGCTCGGGGTTGGAGCCAGCGACCGTCGGTAGCAGCATGGCTCTCCTCT 
TTCTCCTACCCCTTGTCATGCAGGGTGTGAGCAGGGCTGAGATGGGCACCGCGGATCTGGGGCCGTCCTCAGTGC 
60 CTACACCAACTAATGTTACAATTGAATCCTATAACATGAACCCTATCGTATATTGGGAGTACCAGATCATGCCAC 
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aggtccctgtttttaccgtagI^taaagaac^ 

tttctcatcattattgtaatatttctgatcatgttggtgatccatcaaattctctttgggtcagagttaaagcca 

G 



5 SEQIDNO:136 

>GL3__2 .37_C_M13R Direction: sense 

CTGGCTTTAACTCTGACCGAAAGAGAATTTGATGGATCACCAACATGATCAGAAATATTACAATAATGATGAGAA 
ATATTGATGCAGGCATCAATCCATTCTGAATTCTTAACACCATAGTTCTTTACCTCTACGGTAAAAACAGGGACC 
TGTGGCATGATCTGGTACTCCCAATATACGATAGGGTTCATGTTATAGGATTCAATTGTAACATTAGTTGGTGTA 
10 GGCACTGAGGACGGCCCCAGATC CGCGGTGCCCATCTCAG CCCTGC TC AC ACCCTGCATGACAAGGGGTAGGAGA 
AAGAGGAGAGCCATGCTGCTACCGACGGTCGCTGGCTCCAACCCCGAGCGCCTGCGGGACCAGCCCAGCGCTGCC 
C 

SEQIDNO:137 

15 >GL3_2 . 37_D_M13F Direction: sense 

CTGGCTTTAACTCTGACCCAAAGAGAATTTGATGGATCACCAACATGATCAGAAATATTACAATAATGATGAGAA 
ATATTGATGCAGGCATCAATCCATTCTGAATTCTTAACACCATAGTTCTTTACCTCTACGGTAAAAACAGGGACC 
TGTGGCATGATCTGGTACTCCCAATATACGATAGGGTTCATGTTATAGGATTCAATTGTAACATTAGTTGGTGTA 
GGCACTGAGGACGGCCCCAGATCCGCGGTGCCCATCTCAGCCCTGCTCACACCCTGCATGACAAGGGGTAGGAGA 

20 AAGAGGAGAGCCATGCTGCTACCGACGGTCGCTGGCTCCAACCCCGAGCGCCTGCGGGACCAGCCCAGCGCTGCC 
C 

SEQIDNO:138 

>GL3__2 .37_D_M13R Direction: anti-sense 
25 GGGCAGCGCTGGGCTGGTCCCGCAGGCGCTCGGGGTTGGAGCCAGCGACCGTCGGTAGCAGCATGGCTCTCCTCT 
TTCTCCTACCCCTTGTCATGCAGGGTGTGAGCAGGGCTGAGATGGGCACCGCGGATCTGGGGCCGTCCTCAGTGC 
CTACACC^CTAATGTTACAATTGAATCCTATAACATGAACCCTATCGTATATTGGGAGTACCAGATCATGCCAC 

TTTCTCATCATTATTGTAATATTTCTGATCATGTTGGTGATCCATCAAATTCTCTTTGGGTCAGAGTTAAAGCCA 
30 G 

SEQIDNO:139 

>gi | 4557879 | ref |NM_000416 . 1 1 Homo sapiens interferon gamma receptor 1 
(IFNGR1) , mRNA 

35 CCGCAGGCGCTCGGGGTTGGAGCCAGCGACCGTCGGTAGCAGCATGGCTCTCCTCTTTCTCCTACCCCTT 
GTCATGCAGGGTGTGAGCAGGGCTGAGATGGGCACCGCGGATCTGGGGCCGTCCTCAGTGCCTACACCAA 
CTAATGTTACAATTGAATCCTATAACATGAACCCTATCGTATATTGGGAGTACCAGATCATGCCACAGGT 
CCCTGTTTTTACCGTAGAGGTAAAGAACTATGGTGTTAAGAATTCAGAATGGATTGATGCCTGCATCAAT 
ATTTCTCATCATTATTGTAATATTTCTGATCATGTTGGTGATCCATCAAATTCTCTTTGGGTCAGAGTTA 

40 AAGCCAGGGTTGGACAAAAAGAATCTGCCTATGCAAAGTCAGAAGAATTTGCTGTATGCCGAGATGGAAA 
AATTGGACCACCTAAACTGGATATCAGAAAGGAGGAGAAGCAAATCATGATTGACATATTTCACCCTTCA 
GTTTTTGTAAATGGAGACGAGCAGGAAGTCGATTATGATCCCGAAACTACCTGTTACATTAGGGTGTACA 
ATGTGTATGTGAGAATGAACGGAAGTGAGATCCAGTATAAAATACTCACGCAGAAGGAAGATGATTGTGA 
CGAGATTCAGTGCCAGTTAGCGATTCCAGTATCCTCACTGAATTCTCAGTACTGTGTTTCAGCAGAAGGA 

45 GTCTTACATGTGTGGGGTGTTACAACTGAAAAGTCAAAAGAAGTTTGTATTACCATTTTCAATAGCAGTA 
TAAAAGGTTCTCTTTGGATTCCAGTTGTTGCTGCTTTACTACTCTTTCTAGTGCTTAGCCTGGTATTCAT 
CTGTTTTTATATTAAGAAAATTAATCCATTGAAGGAAAAAAGCATAATATTACCCAAGTCCTTGATCTCT 
GTGGTAAGAAGTGCTACTTTAGAGACAAAACCTGAATCAAAATATGTATCACTCATCACGTCATACCAGC 
CATTTTCCTTAGAAAAGGAGGTGGTCTGTGAAGAGCCGTTGTCTCCAGCAACAGTTCCAGGCATGCATAC 

5 0 CGAAGACAATCCAGGAAAAGTGGAACATACAGAAGAACTTTCTAGTATAACAGAAGTGGTGACTACTGAA 
GAAAATATTCCTGACGTGGTCCCGGGCAGCCATCTGACTCCAATAGAGAGAGAGAGTTCTTCACCTTTAA 
GTAGTAACCAGTCTGAACCTGGCAGCATCGCTTTAAACTCGTATCACTCCAGAAATTGTTCTGAGAGTGA 
TCACTCCAGAAATGGTTTTGATACTGATTCCAGCTGTCTGGAATCACATAGCTCCTTATCTGACTCAGAA 
TTTCCCCCAAATAATAAAGGTGAAATAAAAACAGAAGGACAAGAGCTCATAACCGTAATAAAAGCCCCCA 

55 CCTCCTTTGGTTATGATAAACCACATGTGCTAGTGGATCTACTTGTGGATGATAGCGGTAAAGAGTCCTT 
GATTGGTTATAGACCAACAGAAGATTCCAAAGAATTTTCATGAGATCAGCTAAGTTGCACCAACTTTGAA 
GTCTGATTTTCCTGGAGAGTTTTCTGCTTTAATTTCATGAAAAGATTATGATCTCAGAAATTGTATCTTA 
GTTGGTATG^CCAAATGGAGTGACTTAGTGTAC^TGAAAGCGTJ^ 
TTGGCTTGTAAAGTAGAGACTTTTTTTTTTTTTTAAACAAAAAAAGCATTGTAACCT 

60 TCCAGATAGGTTACCAGTAACGGAACATATCCAGTACTCCTGGTTCCTAGGTGAGCAGGTGATGCCCCAG 
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ggacctttgtagccacttct!ctttttttcttttctctgccttggtatagcata , 1Wgttttgtaagttta 
tgcatacagtaattttaagtaatttcagaagaaattctcgaagcttttcaaaattggacttaaaatctaa 
ttcaaactaatagaattaatggaatatgtaaatagaaacgtgtatattttttatgaaacattacagttag 
agatttttaaataaagaattttaaaactc 

5 

SEQIDNO:140 

>gi| 4557880 |ref | NP_000407 . 1 1 interferon gamma receptor 1; Immune 
interferon, receptor for [Homo sapiens] 
MALLFLLPLVMQGVSRAEMGTADLGPSSVPTPTWTIES 
10 SEWIDACINISHHYCNISDHVGDPSNSLWVRVKA 

IMIDIFHPSVFVNGDEQEVDYDPETTCYIRVYlSr^mmMNGSEIQ 

SQ YCVS AEGVLHWGVTTE KS KE VC I TIFNS SI KG SLW I PVVAALLLFLVLSLVF I CFY I KKI 
1 1 L PKS L I S WRS ATLETKP E S KYVS Ii I TS YQP FS LEKE WCEE PL S PATVPGMHTEDNPGKVEHTEELS 
SITEWTTEENIPDWPGSHLTPIERESSSPLSSNQSEPGSIALNSYHSRNCSESDHSRNGFDTDSSCLE 
15 SHSSLSDSEFPPNNKGEIKTEGQELITVIKAPTSFGYDKPHVLVDLLVDDSGKESLIGYRPTEDSKEFS 

SEQIDNO:141 

Import in a4 

>GIi2_91_J2_M13F Direction: sense 

20 CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGAACCACCTGAACCACTTGAG 
AATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGTACAAGGTACTCAACCTGATCTTTTC 
TGCCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGCTGAT 
GAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGAACTTGTTGCTGGTTGCCTGCTGTTATGTTGGAAA 
GGAACCACACTGCTTCCTTATTTATCTTCTCTTTTGGGTGTGATAAGAGATTTGGGAAGTGTGACAGGACATCAC 

25 AATTGAGAACAACCTGGGTCTGCTCGTCGGTGCCAGTCACTATGTTGCCAACTGCTCTGAGGGCTGCTGTTTGAA 
CTTTGACTTCCTGATGGCTCAGAAGGGGCACAAGAAAGGGCACAACTCCTGAATTCCAATAACCATCTGGTATCT 
GTTCATTACCTTCCATCTGTCAAAGTTATGACAGAGCCCAAAACAAGTGTCTTACAAGAAATGTTTATATCTGGA 
ATGGGTAATATGAAGGACAACATAAAAGGC 

30 SEQIDNO:142 

>GL2_91__1_JVI13R Direction: sense 

CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGAACCACCTGAACCACTTGAG 
AATCTTTCACTGACAGTAAATTACAGAAGGGTGGTATTACATTCTGCTGTACAAGGTACTCAACCTGATCTTTTC 
TGCCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGCTGAT 

35 

GAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGAACTTGTTGCTGGTTGCCTGCTGTTATGTTGGAAA 

GGAACCACACATGCTTCCTTATTTATCTTCTCTTTAGGGTGTGATAAGAGATTTGGGAAGTGTGACAGGACATCA 

CAATTTGAGGAACAACCTGGGTCTGACTCGTCGGTGCCAGTCACTATGTTGCCAACTGCATCTGAGGGCCTGCTG 

TTTGAAACTTTGACTT C CCTGATGGCTC AGAAGGGGCACAAGAAAGGGCNACAAACTCCTGGAATCNCAATAAAC 

AATCTGTATCTGTTCAATAACCCTCCCATCCTGGTGCAAAGGTANCTGAACACGGGAGGCCCCAAAACAGGTGGT 

40 CCTACAAGGAAATGGTGAACAAACATGGAAATGGGAATAGGANGGACCACCACATAAAGG 

i 

SEQEDNO:143 

>GL2_91_3_M13R Direction: sense 

CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGAACCACCTGAACCACTTGAG 
45 AATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGTACGAGGTACTCAACCTGATCTTTTC 
TGCCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGCTGAT 
GAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGAACTTGTTGCTGGTTGCCTGCTGTTATGTTGGAAA 
GGAACCACACTGCTTCCTTATTTATCTTCTCTTTTGGGTGTGATAACGAGATTTGGGAAGTGTGACAGGACATCA 
CAATTGAGAACAACCTGGGTCTGCTCGTCGGTGCCAGTCACTATGTTGCCAACTGCTCTGAGGGCTGCTGTTTTG 
50 AACTTTGACTTCCTGATGGCTCAGCAAGGGGCACAAGAAAAGGGCACAAACTTCCCTGAAAATCCAATAAACCAA 
TCTGTAATTCTGGTTCACATTTACCTTCCAATTCTTGGTTCAAAGGTATGACAGAAGCCCGAAAACAGGTGTTCT 
ACCAAGAAATTGTTCACATAATCCTGAATGGGAATANTGACNGGACACACATTAAAGCCGCAAAAACCANACANA 
CAAAACCAGAC 

55 SEQIDNO:144 

>GL2_2 02_1_M13R Direction: anti-sense 

CTTTATGTGTCCTCATATACCATACAGATATAAACATTCTTGTAGACACTGTTTGGGCTCTGTCATACTTGACAG 
ATGGAGGTAATGAACAGATACAGATGGTTATTGATTCAGAGATTGTGCCCTTTCTTGATGCCCCTTCTTTGGGGA 
AAANCCCCAAACTCCCACGGGGAGAAGCGTCCNAACAAAGNGTTTCCCCACAAAACAAGCACCACAACGCGGCCG 
60 AAGGACACCGCAGTTCCAACNGGAAAAAGAGGAGAGCCCACAACACACGGGAGCATAATATNTGAGGGGGGGGCC 
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JUVGAAAACACCCGCAACTTA^ 

GGAAAAGGGCCCAANGGAAAAACACCCCACCCCANCCCCAANAACACGGGGGGGGAGAATGCATCACGGGGCGAG 
ATCAATACCAGATTTCACCCAAAAAACATTCACAACG 

SEQIDNO:145 

>GIi2_202_l_M13F Direction: sense 

CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGT^ACCACCTGAACCACTTGAG 

AATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGGTTTAACAAGGTACTCAACCTGATCT 

TTTCTGCCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGG 
CTGATGAATTATCATAGGAATTAATCCAG 

SEQEDNO:146 

>GL2_66_F10J33F1 Direction: N/A 

CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGAANCACCTGAACCACTTGAG 
AATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGTACAAGGGGACTCAACTGATCTTTTC 
TGGCACTTATTGGTAAGTTGCTGATTGGCCAAGGCAGCTTCTTTTTGTGTTCCAAAGTTCCCCTTAGCAAGCTGA 
TGAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGGNACCTGTTGCAGGGTTGACCTGCTGGATAATGT 
TTGGGAAGGAAACAGACTGGGTTCCCTTAATTTATCTTATCTCTATAGGGAGTGATAAGAGATATATGGGAAAGT 
TGTTAACAGGACATTACAATTTGATGAACAACCTGGGTCTGATACGTCCNCAG1TTGGGCCCACNCGCAACGCAAA 
TAAGTTTTGGGCCCACAACCAGGGGGCCCACGAAAAAGAGGACGCAGC^ 

AACANTACCCTCNGGAATAGAGCGTCCCAAANAGGGGGCGGACCCAAGGAANAAAGGGGGACCAAAGTACCCTGG 
AAATCCACACATAAAACCATACTGTAATACTCGTGTGCCACGCTGTTAATACCCCANTTCACCAACAATCCACTG 
GGGTCNACACAGGATNAOTTGAAAGCACACGGAAGAGAGCCGC^ 

CAACCACANGGGTGCGCTGAAGTAGCAATAACCAGGANAANTGGGGGGGACGAGCAACTTNGTAAAGGGTGAACG 

SEQIDNO:147 

>GL2_91_1_M13F Direction: anti-sense 

CTTTATGTGTCCTC7VTATACCATACAGATATAAACATTCTTGTAGACACTGTTTGGGCTCTGTCATACTTGACAG 

ATGGAGGTAATGAACAGATACAGATGGTTATTGATTCAGGAGTTGTGCCCTTTCTTGTGCCCCTTCTGAGCCATC 

AGGAAGTCAAAGTTCAAACAGCAGCCCTCAGAGCAGTTGGCAACATAGTGACTGGCACCGACGAGCAGACCCAGG 

TTGTTCTCAATTGTGATGTCCTGTCACACTTCCCAAATCTCTTATCACACCCAAAAGAGAAGATAAATAAGGAAG 

CAGTAGTGGTTCCTTTCCAACATAACAGCAGGCAANCAGCAACAAGTTCAAGCTGTAATAGATGCTGGATTAATT 

CCTATGATAATTTCATCAGCNTTGCTAAGNGGGGACATATTGGAAACACAAAAAAGAAGCTGCATATGGGGCAAC 
AATCAGGCAAANC 

SEQIDNO:148 

>GL2_91_2_M13R Direction: anti -sense 

CTTTATGTGTCCTCATATACCATACAGATATAAACATTCTTGTAGACACTGTTTGGGCTCTGTCATACTTGACAG 
ATGGAGGTAATGAACAGATACAGATGGTTATTGATTCAGGAGTTGTGCCCTTTCTTGTGCCCCTTCTGAGCCATC 
AGGAAGTCAAAGTTCAAACAGCAGCCCTCAGAGCAGTTGGCAACATAGTGACTGGCACCGACGAGCAGACCCAGG 
TTGTTCTCAATTGTGATGTCCTGTCACACTTCCCAAATCTCTTATCACACCCAAAAGAGAAGATAAATAAGGAAG 
CAGTGTGGTTCCTTTCCAACATAACAGCAGGCAACCAGCAACAAGTTCAAGCTGTAATAGATGCTGGATTAATTC 
CTATGATAATTCATCAGCTTGCTAAGGGGGACTTTGGAACACAAAAAGAAGCTGCTTGGGCAATCAGCAACTTAA 
CAATAAGTGGCAAGAAAAGATCAGGTTGAGTACCTTGTACAGCAGAATTGTAAATACCACCGTTCTGTAAATTT 

SEQIDNO:149 

>GI*2__91_3_M13F Direction: anti-sense 

CTTTATGTGTCCTCATATACCATACAGATAT7VAACATTCTTGTAGACACTGTTTGGGCTCTGTCATACTTGACAG 

ATGGAGGTAATGAACAGATACAGATGGTTATTGATTCAGGAGTTGTGCCCTTTCTTGTGCCCCTTCTGAGCCATC 

AGGAAGTCAAAGTTCAAACAGCAGCCCTCAGAGCAGTTGGCAACATAGTGACTGGCACCGACGAGCAGACCCAGG 

TTGTTCTCAATTGTGATGTCCTGTCACACTTCCCAAATCTCTTATCACACCGAAAAGAGAAGATAAATAAGGAAG 

CAGTGTGGTTCCTTTCCAACATAACAGCAGGCAACCAGCAACTyVGTTCAAGCTGTAATAGATGCTGGATTAATTC 

CTATGATAATTCATCAGCTTGCTAAGAGGGGACTTTGGAACACAAAAAGAAGCTGCTTGGGCAATCAGCAAACTT 

AACAAATAAAGTGGCAGAAAAGATCAGGTTGAGTACCTCGTACAGCAGAATGTAATACCACCGATACTGTTAACT 

TNTACCTGTCAGTGAACAGATCCTCAAGTGGATCANAGAGGGACCTAGAATGGGTCCTAAAAAAAAAACAATNTC 

TGGATAAATTNGGGCCGGGAGGACATGAAAAGCAAAGGCAACAAATCAAGGACACGGAACGAGAAACC 

CACATAGGAAGACCACCCGNTGAGCCAACCCGGGACTATGAGGAGGGGCNTGCCCTCCATGGANACAGGAGCAGC 

GACCCAAGNCCAAAAAAAGGGGCGAATACAGTAGAACAAGAGNAGNAACAACACCATACAAGAGAGAANAA 
GACAGCCA 
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SEQIDNO:150 

>GL2_66_F07_G3F1 Direction: N/A 

CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGAACCACCTGAACCACTTGAG 
AATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGTACAAGGTACTCAACCTGATCTTTTC 
5 TGCCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGCTGAT 
GAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGAACTTGTTGCTGGTTGCCTGCTGTTATGTTGGAAA 
GGAACCACACTGCTTCCTTATTTATCTTCTCTTTTGGGTGTGATAAGAGATTTGGGAAGTGTGACAGGACATCAC 
AATTGAGAACAACCTGGGTCTGCTCGTCGGTGCCAGTCACTATGTTGCCAACTGCTCTGAGGGCTGCTGTTTGAA 
CTTTGACTTCCTGATGGCTCAGAAGGGGCACAAGAAAGGGCACAACTCCTGAATCAATAACCATCTGTATCTGTT 
1 0 CATTACCTCCATCTGTCAAGTATGACAGAGCC 



SEQIDNO:151 

>GL2_66_F08_G3F1 Direction: N/A 

CTATTGTGCTTGCTTCATCACCGGCCATTATCAGAATGTTTTTTAGACCATCTAGAACCACCTGAACCACTTGAG 
15 AATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGTACAAGGTACTCAACCTGATCTTTTC 
TACCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGCTGAT 
GAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGAACTTGTTGCTGGTTGCCTGCTGTTATGTTGGAAA 
GGAACCACACTGCTTCCTTATTTATCTTCTCTTTTGGGTGTGATAAGAGATTTGGGAAGTGTGACAGGACATCAC 
AATTGAGAACAACCTGGGTCTGCTCGTCGGTGCCAGTCACTATGTTGCCAACTGCTCTGAGGGCTGCTGTTTGAA 
20 CTTTGACTTCCTGATGGCTCAGAAGGGGCACAAGAAAGGGCACAACTCCTGAATCAATAACCATCTGTATCTGTT 
CATTACCTCCATCTGTCAAGTATGACAGAAGCCCAAACAGCTGTCTACAAGAATGTTTATATCTGTATGGTATAT 
TGAAGGACACCATAAAG 



SEQEDNO:152 

25 >GL2_jS6_F09_G3Fl Direction: N/A 

CTATTGTGCTTGGCTTCATCACCGGGCCATTATCAGAATGTTTTTTAGACCATCTAGAACCACCTGAACCACTTG 
AGAATCTTTCACTGACAGTAAATTACAGAACGGTGGTATTACATTCTGCTGTACAAGGTACTCAACCTGATCTTT 
TCTGCCACTTATTGTTAAGTTGCTGATTGCCCAAGCAGCTTCTTTTTGTGTTCCAAAGTCCCCCTTAGCAAGCTG 
ATGAATTATCATAGGAATTAATCCAGCATCTATTACAGCTTGAACTTGGTTGCTGGTTGCCTGCTGTTATGTTGG 

30 AAAGGAACCACACTGCTTCCTTATTTATCTTCTCTTTTGGGTGTGATAAGAGATTTGGGAAGTGTGACAGGACAT 
CACAATTGAGAACAACTGGGTCTGCTCGTCGGTGCCAGTCACTATGTTGCCAACTGCTCTGANGGCTGCTGTTTG 
AACTTTGACTTCCTGATGGCTCAAAGGGGCACAAGAAAGGGCACAACTCCTGAATCAATAACCATCTGTATCTGT 
GTCATTACCTCCATCTGTCAAGTATGACAGAGCCCAAACAGTGTCTACACAGCAATGTTCTATATCTCGTATGGN 
GATATGAGGGACACCTNAAAGCCG 

35 

SEQIDNO:153 

>gi | 4504898 | ref |NM_002267 . 1 1 Homo sapiens karyopherin alpha 3 (importin 
alpha 4) (KPNA3), mRNA 

AATTCGTGGTTCCAGAATCGATAGGGCTCCAAGATTCGCCGCCGCCGCCGCCGCAGCCGCAGGAGTAGCC 
40 GCCGCCGGAGCCGCGCGCAGCATGGCCGAGAACCCCAGCTTGGAGAACCACCGCATCAAGAGCTTCAAGA 
ACAAGGGCCGCGATGTGGAAACAATGCGAAGACATAGAAATGAAGTGACAGTGGAACTGCGGAAGAACAA 
AAGAGATGAACACTTATTGAAAAAGAGAAATGTTCCCCAAGAAGAAAGTCTAGAAGATTCAGATGTTGAT 
GCTGATTTTAAAGCACAAAATGTAACCCTAGAAGCTATATTGCAGAATGCCACAAGTGATAACCCAGTGG 
TCCAATTGAGTGCTGTCCAGGCAGCAAGAAAACTGTTATCCAGTGACCAGAATCCACCGATTGATGACTT 
45 AATAAAATCTGGGATTTTACCAATTCTAGTCAAATGTCTAGAAAGGGATGATAATCCTTCATTACAGTTT 
GAAGCTGCTTGGGCATTAACTAACATAGCATCAGGAACTTCTGCACAGACTCAAGCTGTTGTGCAGTCTA 
ATGCAGTACCTCTTTTTCTGAGACTTCTTCGTTCACCACATCAGAATGTTTGTGAACAAGCAGTATGGGC 
TTTGGGAAACATTATAGGTGATGGTCCTCAATGTAGAGATTATGTCATATCACTGGGAGTTGTCAAACCT 
CTTCTGTCCTTCATCAGTCCCTCCATCCCCATCACCTTCCTTCGGAACGTCACATGGGTCATTGTCAATC 
50 TCTGCAGGAATAAGGATCCCCCACCGCCTATGGAGACAGTTCAGGAGATTTTGCCAGCTTTATGTGTCCT 
CATATACCATACAGATATAAACATTCTTGTAGACACTGTTTGGGCTCTGTCATACTTGACAGATGGAGGT 

AATGAACAGATACAGATGGTTATTGATTCAGGAGTTGTGCCCTTTCTTGTGCCCCTTCTGAGCCATCAGG f 

AAGTCAAAGTTCAAACAGCAGCCCTCAGAGCAGTTGGCAACATAGTGACTGGCACCGACGAGCAGACCCA 

GGTTGTTCTCAATTGTGATGTCCTGTCACACTTCCCAAATCTCTTATCACACCCAAAAGAGAAGATAAAT 

5 5 AAGGAAGCAGTGTGGTTCCTTTCCAACATAACAGCAGGCAACCAGCAACAAGTTCAAGCTGTAATAGATG 
CTGGATTAATTCCTATGATAATTCATCAGCTTGCTAAGGGGGACTTTGGAACACAAAAAGAAGCTGCTTG 
GGCAATCAGCAACTTAACAATAAGTGGCAGAAAAGATCAGGTTGAGTACCTTGTACAGCAGAATGTAATA 
CCACCGTTCTGTAATTTACTGTCAGTGAAAGATTCTCAAGTGGTTCAGGTGGTTCTAGATGGTCTAAAAA 
ACATTCTGATAATGGCCGGTGATGAAGCAAGCACAATAGCTGAAATAATAGAGGAATGTGGAGGTTTGGA 

60 GAAAATTGAAGTTTTACAGCAACATGAAAATGAAGACATATATAAATTAGCATTTGAAATCATAGATCAG 
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TATTTCTCTGGTGATGATATTGATGAAGATCCCTGCCTCATTCCTGAAGCAACACAAGGAGGTACCTACA 
ATTTTGATCCAACAGCCAACCTTCAAACAAAAGAATTTAATTT 

CCCACATTCAATATGAAGCACCACCAGATGGCTACCAAATGATAAGAACAACAGCAACAAAAGGCTCCAA 
AACACACATGCCTCTTTGTTTTGATGCTTCTAAAGAAAGCCATGTCTCAGTCACTTTGCAGTTGCCAAAA 
5 GTCACTATCACATGGACTGTAAATGCATATGCATGATTTCCTAAACTGTTTTAGAACTCTCCTTAACAAT 
CTCAACTACCCTATTTTTCCCTGTTCCCTGGTGCCACAGGCTGACAACTGCAGTCTCCAGTTTAGAATAA 
ATATTCCATAGTGGTGACATGTCAGCTGCCCACTGATACTCCTTTGGAAAATGGTGCGCTGTGGATCAAG 
ACACTTTGGTATGATGCATATACAAGTTGGAAGACTAAAGAGGTGCAATGTGATCTGAGCCTCCATCATT 
GTCCTCCACAAACATATTTTCATATTCTTTATGTGGAAGAATAGATTTTAAAGTACAAGCCAAATGATTT 
1 0 TCATTGGTGGAACTGACACAAAAAAAAGTAACTTAAAAAC AAGAAACTTGGTTATTGAATAAACAGATAA 
GTTTT 

SEQIDNO:154 

>gi | 4504899 | ref |NP_J302258 . 1 1 karyopherin alpha 3; importin alpha 4 [Homo 
15 sapiens] 

MAENP S LENHR I KS FKNKGRD VE TMRRHRNE VTVE LRKNKRD EHL L KKRNVP QE E S LED S D VD AD FKAQN 
WLEAILQNATSDNPWQLSAVQAARKLLSSDQNPPIDDLIKSGILPILVK^^ 

NIASGTSAQTQAWQSNAVPLFLRLLRSPHQNVCEQAVWALGNIIGDGPQCRDYVISLGVVKPLLSFISP 
SIPITFLRNVTWIVNLCRNKDPPPPMETVQEXLPALC^ 
20 IDSGVVPFLVPLLSHQEVKVQTAALRAVGNIVTGTDEQTQV^ 

SNI TAGNQQQVQAVTDAGL I PM I IHQLAKGDFGTQKEAAWAI SNLT I SGRKDQVE YLVQQNVI P PFCNLL 
SVKDSQWQWLDGLKNILIMAGDEASTIAEI IEECGGLEKIEVLQQHENEDI YKLAFEI IDQYFSGDDI 
DED P CLi I PE ATQGGT YNFDPTANLQTKE FNF 

25 SEQIDNO:155 

Lysosomal pepstatin-insensitive protease (CLN2) 
>GL3__21A_2_M13F Direction: N/A 

CCATTTCAATACTATGTGGTCGTTACAATGCTAGAGGTAGTGCACAGGGGGCGCAATGCAAGGGAGGAAGGGCGT 
TAACATCTGCCACCTACTTCCAGGTGCCAAGCACTGTTATCAACATTATTCCACTTTATTCCCCTGGGTGATTAT 

3 0 GAAAGGCC AGGTATTGAGTATTCACACTTNACAG ACGAGGAAAACAAGCGCTCAGGGNAGAATAAGCGTTACTTG 
ANCCAGGTCATACAATACGTTCACCTAGAGTCCCANTTATTCTAGGAAACCACAGGAGTTTCCAAAACAGGGGTG 
TTTGGTGTTNAGGAAACAATCNTATGGGGAGATATTTTTCCAAGCGCAATTACCACAACATTATAAAGTANGCCT 
CCAGGGGGGAAAATCAAAACCATTCCACGAAAAGAAAGGGTNGAACATTCTGGAGAGCATTGAAGGATCTTTGCG 
CAAAGAAATTCCTTGGGAAAAATTTTACCCTGGGCAAGGTATGTTATCAAGAGGGGTTGGGTAAAGGGGGGNACC 

35 GGGAGAACAAGCGTTTGAAGCGCA 

SEQEDNO:156 

>GL3_21A_4_M13F Direction: N/A 

CCATTTCAATACTATGTGGTCGTTACAATGCTAGAGGTAGGCACAGGGGGCGCAATGCAAGGGAGGAAGGGCGTT 
40 AACAATTCCTGGCGCACGCTAACTTTCCAGGTGGCCAAGGCCACTGGTTATCAACCAGTTATTCCACGTTTTATT 
TCCCCTGGGTTGGAATTTACTGGAAAAAGGGGGCCAAGGGGGTTAAATTGGAGTATTCACACTTTTATACAGACG 
AGGGAAACAGGCCTCAGGGAGATAAGCTTACTTGACCCAAGTCTCTCGTCCTTAGGTCCATATCCAGGGAAACCC 
AAAGGGAATTTTCCAAAACCAGG . 

45 SEQIDNO:157 

>GL3_21A_1__M13F Direction: N/A 

CCATTTCAATACTATGTGGTCGTTACAATGCTAGAGGTAGGCACAGGGGGCGCAATGCAAGGGAGGAAGGGCGTT 
AACATCTGCCACCTACTTCCAGGTGCCAAGCACTGTTATCAACATTATTCCACTTTATTCCCTGGTGATTATGAA 
AGGCAGGTATTGATATTCACACTTAACAGACGAGGAAACAGCCTCAGGGAGATCAAGCTTACTTGACCCAGTCTC 
50 TCTCCTAGTTCCATATCAGAACCAAGATTCAAACAGGTTTGGTTTAGAAAATCTAGGATTTTTCAGCCATACCAA 
AATAAAGTAGCCTCAGGGAATCAAAACATTCACGAAAGAAGGTGACTTCTGAACTGAGTCTTGCAAGAATTCTGG 
AAATTAACCTGGCAGTTGTATCAGGGGTGGTAAGGGGACGGAA 
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SEQIDNO:158 
>GL3_21A_2_M13R Direction: anti-sense 

CCACTTGGATGAAAACCCTTCAAGGATTACTGGATAGAATTCAGGCTTTCCCCTCTAGCCCCCAATCATAGCTCA 
CAAACCTTCCTTGCTATTTGTTCTTAAGTAAAAAATGATTTTTCCTCCTCCCTCCCCAAACCCCAAGGAACTCTC 
5 ACTCTTGCTCAAGCTGTTCCGTCCCCTTACCACCCCTGATACAACTGCCAGGTTAATTTCCAGAATTCTTGCAAG 
ACTCAGTTCAGAAGTCACCTTCTTTCGGTGAATGTTTTGATTCCCTGAGGCTACTTTATTTTGGTATGGCTGAAA 
AATCCTAGATTTTCTAAACAAAACCTGTTTGAATCTTGGTTCTGATATGGACTAGGAGAGAGACTG 

SEQIDNO:159 

10 >GL3_21A_3jVI13F Direction: N/A 

CCATTTCAATACTATGTGGTCGTTACAATGCTAGAGGTAGGCACAGGGGGCGCAATGCAAGGGAGGAAGGGCGTT 
AACATCTGCCACCTACTTCCAGGTGCCAAGCACTGTTATCAACATTATTCCACTTTATTCCCTGGTGATTATGAA 
AGGCAGGTATTGATATTTCACACTTAACAGACGAGGAAACAGCCTTCAGGGAGATAAGCTTACTTGACCCAGTCT 
CTCTTCCTAGTCCATATCAGAACCAAGATTCAAACAGGTTTGGTTTAGAAAATCTA 

15 

SEQIDNO:160 

>GL3_21A_3_M13R Direction: anti- sense 

CCACTTGGATGAAAACCCTTCAAGGATTACTGGATAGAATTCAGGCTTTCCCCTCTAGCCCCCAATCATAGCTCA 
CAAAGCCTTCCTTGCTATTTGTTCTTAAGTAAAAAATCATTTTTCCTCCTCCCTCCCCAAACCCCAAGGAACTCT 
20 CACTCTTGCTCAAGCTGTTCCGTCCCCTTACCACCCCTGATACAACTGCCAGGTTAATTTCCAGAATTCTTGCAA 
GACTCAGTTCAGAAGTCACCTTCTTTCGTGAATGTTTTGATTCCCTGAGGCTACTTTATTTTGGTATGGCTGAAA 
AATCCTAGATTTTCTAAACAAAACCTGTTTGAATCTTGGTTCTGATATGGACTAGGAGAGA 

SEQIDNO:161 

25 >GL3_21A__4_M13R Direction: anti -sense 

CCACTTGGATGAAAACCCTTCAAGGATTACTGGATAGAATTCAGGCTTTCCCCTCTAGCCCCCAAGTCATAGCTC 
ACAAAGCCTTCCTTGCTATTTGTTCTTAAGTAAAAAATCATTTTTCCTCCTCCCTCCCCAAACCCCAAGGGAACT 
CTCACTCTTGGCTCAAGGCTGTTCCGTTCGNCCTTACCACCCCTGATACAACTGCCAGGTTAATTTCCAGAATTC 
TTGCAAGACTCAGTTCAGAAGTCACCTTCTTTCGTGAATGTTTTGATTCCCTGAGGCTACTTTATTTTGGTATGG 

30 CTGAAAAATCCTAGATTTTCTAAACAAAACCTGTTTGAATCTTGGTTCTGATATGGACTAGGAGAGAGACTGGGT 
CAAGTAAGCTTATCTCCCTGAGGCTGTTTCCT 

SEQEDNO:162 

>GL3__21A_PCR__G3F1 Direction: N/A 

35 CCATTTCAATACTATGTGGTCGTTACAATGCTAGAGGTAGGCACAGGGGGCGCAATGCAAGGGAGGAAGGGCGTT 
AACATCTGCCACCTACTTCCAGGTGCCAAGCACTGTTATCAACATTATTCCCACTTTATTCCCTGGTGATTATGA 
AAGGCAGGTATTGATATTCACACTTAACAGACGAGGAAACAGCCTCAGGGAGATAAGCTTACTTGACCCAGTCTC 
TCTCCTAGTCCATATCAGAACCAAGATTCAAACAGGTTTTGTTTAGAAAATCTAGGATTTTTCAGCCATACCAAA 
ATAAAGTAGCCTCAGGGAATCAAAACATTCACGAAAGAAGGTGACTTCTGAACTGAGTCTTGCAAGAATTCTGGA 

40 AATTAACCTGGCAGTTGTATCAGGGGTGGTAAGGGGACGGAACAGCTTGAGCAAGAGTGAGAGTTCCTTGGGGTT 
TGGGGAGGGAGGAGGAAAAATGATTTTTTACTTAAGAACAAATAGCAAGGAAGGTTTGTGAGCTAT 

SEQIDNO:163 

>gi | 5597012 | ref |NM_0 003 91 . 2 | Homo sapiens ceroid-lipofuscinosis, neuronal 

45 2, late infantile ( Jansky-Bielschowsky disease) {CLN2) , mRNA 

ACATGACAGCAGATCCGCGGAAGGGCAGAATGGGACTCCAAGCCTGCCTCCTAGGGCTCTTTGCCCTCAT . 
CCTCTCTGGCAAATGCAGTTACAGCCCGGAGCCCGACCAGCGGAGGACGCTGCCCCCAGGCTGGGTGTCC 
CTGGGCCGTGCGGACCCTGAGGAAGAGCTGAGTCTCACCTTTGCCCTGAGACAGCAGAATGTGGAAAGAC 
TCTCGGAGCTGGTGCAGGCTGTGTCGGATCCCAGCTCTCCTCAATACGGAAAATACCTGACCCTAGAGAA 

50 TGTGGCTGATCTGGTGAGGCCATCCCCACTGACCCTCCACACGGTGCAAAAATGGCTCTTGGCAGCCGGA 
GCCCAGAAGTGCCATTCTGTGATCACACAGGACTTTCTGACTTGCTGGCTGAGCATCCGACAAGCAGAGC 
TGCTGCTCCCTGGGGCTGAGTTTCATCACTATGTGGGAGGACCTACGGAAACCCATGTTGTAAGGTCCCC 
ACATCCCTACCAGCTTCCACAGGCCTTGGCCCCCCATGTGGACTTTGTGGGGGGACTGCACCGTTTTCCC 
CCAACATCATCCCTGAGGCAACGTCCTGAGCCGCAGGTGACAGGGACTGTAGGCCTGCATCTGGGGGTAA 

55 CCCCCTCTGTGATCCGTAAGCGATACAACTTGACCTCACAAGACGTGGGCTCTGGCACCAGCAATAACAG 
CCAAGCCTGTGCCCAGTTCCTGGAGCAGTATTTCCATGACTCAGACCTGGCTCAGTTCATGCGCCTCTTC 
GGTGGCAACTTTGCACATCAGGCATCAGTAGCCCGTGTGGTTGGACAACAGGGCCGGGGCCGGGCCGGGA 
TTGAGGCCAGTCTAGATGTGCAGTACCTGATGAGTGCTGGTGCCAACATCTCCACCTGGGTCTACAGTAG 
CCCTGGCCGGCATGAGGGACAGGAGCCCTTCCTGCAGTGGCTCATGCTGCTCAGTAATGAGTCAGCCCTG 

60 CCACATGTGCATACTGTGAGCTATGGAGATGATGAGGACTCCCTCAGCAGCGCCTACATCCAGCGGGTCA 
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ACACTGAGCTCATGAAGGCTGCCGCTCGGGGTCTCACCCTGCTCTTCGCCTCAGGTGACAGTGGGGCCGG 

GTGTTGGTCTGTCTCTGGAAGACACCAGTTCCGCCCTACCTTCCCTGCCTCCAGCCCCTATGTCACCACA 

GTGGGAGGCACATCCTTCCAGGAACCTTTCCTCATCACAAATGAAATTGTTGACTATATCAGTGGTGGTG 

GCTTCAGCAATGTGTTCCCACGGCCTTCATACCAGGAGGAAGCTGTAACGAAGTTCCTGAGCTCTAGCCC 

CCACCTGCCACCATCCAGTTACTTCAATGCCAGTGGCCGTGCCTACCCAGATGTGGCTGCACTTTCTGAT 

GGCTACTGGGTGGTCAGCAACAGAGTGCCCATTCCATGGGTGTCCGGAACCTCGGCCTCTACTCCAGTGT 

TTGGGGGGATCCTATCCTTGATCAATGAGCACAGGATCCTTAGTGGCCGCCCCCCTCTTGGCTTTCTCAA 

CCCAAGGCTCTACCAGCAGCATGGGGCAGGACTCTTTGATGTAACCCGTGGCTGCCATGAGTCCTGTCTG 

GATGAAGAGGTAGAGGGCCAGGGTTTCTGCTCTGGTCCTGGCTGGGATCCTGTAACAGGCTGGGGAACAC 

CCAACTTCCCAGCTTTGCTGAAGACTCTACTCAACCCCTGACCCTTTCCTATCAGGAGAGATGGCTTGTC 

CCCTGCCCTGAAGCTGGCAGTTCAGTCCCTTATTCTGCCCTGTTGGAAGCCCTGCTGAACCCTCAACTAT 

TGACTGCTGCAGACAGCTTATCTCCCTAACCCTGAAATGCGGTGAGCTTGACTTGACTCCCAACCCTACC 

ATGCTCCATCATACTCAGGTCTCCCTACTCCTGCCTTAGATTCCTCAATAAGATGCTGTAACTAGCATTT 

TTTGAATGCCTCTCCCTCCGCATCTCATCTTTCTCTTTTCAATCAGGCTTTTCCAAAGGGTTGTATACAG 

ACTCTGTGCACTATTTCACTTGATATTCATTCCCCAATTCACTGCAAGGAGACCTCTACTGTCACCGTTT 

ACTCTTTCCTACCCTGACATCCAGAAACAATGGCCTCCAGTGCATACTTCTCAATCTTTGCTTTATGGCC 

TTTCCATCATAGTTGCCCACTCCCTCTCCTTACTTAGCTTCCAGGTCTTAACTTCTCTGACTACTCTTGT 

CTTCCTCTCTCATCAATTTCTGCTTCTTCATGGAATGCTGACCTTCATTGCTCCATTTGTAGATTTTTGC 
TCTTCTCAGTTTACTCATTGTCCCCTGGAAC^^ 

AAGACTTTCTATCCAATAATGATTGATACCTCAAATGTAAGATGCGTGATACTCAACATTTCATCGTCCA 
CCTTCCCAACCCCAAACAATTCCATCTCGTTTCTTCTTGGTAAATGATGCTATGCTTTTTCCAACCAAGC 
CAGAAACCTGTGTCATCTTTTCACCCCACCTTCAATCAACAAGTCCTCAATCAACAAGTCCTACTGACTG 
CACATCTTAAATATATCTTTATCAGTCCACAAGTCCTTCCAATTATATTTCCCAAGTATATCTAGAACTT 
ATCCACTTATATCCCCACTGCTACTACCTTAGTTTAGGGCTATATTCTCTTGAAAAAAAGTGTCCTTACT 
TCCTGCCAATCC C CAAGTC ATCTTCCAGAGTAAAATGCAAATCCCATC AGGCCACTTGGATGAAAACCCT 
TCAAGGATTACTGGATAGAATTCAGGCTTTCCCCTCCAGCCCCCAATCATAGCTCACAAACCTTCCTTGC 
TATTTGTTCTTAAGTAAAAAATCATTTTTCCTCCTCCCTCCCCAAACCCCAAGGAACTCTCACTCTTGCT 
CAAGCTGTTCCGTCCCCTTACCACCCCTGATACAACTGCCAGGTTAATTTCCAGAATTCTTGCAAGACTC 
AGTTCAGAAGTCACCTTCTTTCGTGAATGTTTTGATTCCCTGAGGCTACTTTATTTTGGTATGGCTGAAA 
AATCCTAGATTTTCTAAACAAAACCTGTTTGAATCTTGGTTCTGATATGGACTAGGAGAGAGACTGGGTC 
AAGTAAGCTTATCTCCCTGAGGCTGTTTCCTCGTCTGTTAAGTGTGAATATCAATACCTGCCTTTCATAA 
TCACCAGGGAATAAAGTGGAATAATGTTGATAACAGTGCTTGGCACCTGGAAGTAGGTGGCAGATGTTAA 
CGCCCTTCCTCCCTTGCACTGCGCCCCCTGTGCCTACCTCTAGCATTGTAACGACCACATAGTATTGAAA 
TGGCCAGTTTACTTGTCTGCCTTCCTTTCCAAGACCGTTGGTGCCTAGAGGACTAGAATCGTGTCCTATT 
TAACTTTGTGTTCCCAGGTCCTAGCTCAGGAGTTGGCAAATAAGAATTAAATGTCTGCTACACCGAAACA 



SEQK)NO:164 

>gi | 5729770 | ref | NP_0003 82 . 3 | ceroid- lipof uscinosis , neuronal 2, late 
infantile ( Jansky-Bielschowsky disease) [Homo sapiens] 

MGLQACLLGLFALIDSGKCSYSPEPDQRRTLPPGWSLGRADPEEELSLTFALRQQNVERLSELVQAVSD 
PS S PQYGKYLTXjENVADLVRPS PIjTXjHTVQKWLLAAGAQKCHS VITQDFLTCWLS IRQAELLLPGAEFHH 
WGGPTETHVVRSPHPYQLPQALAPHVDFVGGLHRFPPTSSLRQRPEPQ 

LTS QDVGSGTSNNS QACAQFLEQ YFHDSDLAQFMRL FGGNFAHQAS VARVVGQQGRGRAGI EASLDVQYL 
MSAGANISTWVTSSPGRHEGQEPFLQWIJYIL^^ 

GLTLLFASGDSGAGCWSVSGRHQFRPTFPASSPYVTTVGGTSFQEPFLITNEIVDYISGGGFSWFPRPS 

YQEEAVTKFLSSSPHLPPSSYFNASGRAYPDVAALSDGYWWSNRVPIPWVSGTSASTPVFGGILSLINE 

HRILSGRPPLGFLNPRLYQQHGAGLFDVTRGCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLKTL 
LNP 

SEQIDNO:165 

Novel protein disulfide isoraerase 
Novel (disulfide disomerase-like) 

CGGACCAACACAGTATTGAGTCAACTGTGACCTTAAGATCAGAGGAACGTCAATACTGCCACAAGGCCACCTTTC 
CAGAACTCGTGGGCAGGTAAACTATGCTTTGGATGTGCTTTCTTTCACCAAAATCACTCAACTCAGGAGCCAC^ 
ATAGTCCAGCAATTTCATTTCCCTCAACGCTATTTTAGTCTCAAAGGAAACCATGTAAATTTCATCAAGAGAAG^ 
TCAAAGGGGATATATCGCCACTGAAAATGTTTACACAGTGACCATGAGTTACACATTTACTTAGAGAAACTTAAC 
TTAATAAAGAATCTGTAGAGTGTGTTGGCTTGGAAAACACACACACAAAGAAGATACCTCACGCTTAGTATGTTC 
TGCTTTCTGAACAGCCACCACTGGGAACCCAGTGGCCTCTGTGGGACTGAACTCCTAAACGCAGGGTGCGGGAGC 
TGGGCAGGAGAGGTGACCTCCAACTGTGTTCCTAAAGTTCGTCTTTCGCTTGGCTCAGGACAAAGCGGTGTAACG 
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AGTCAAAGTCTCTGCCTCCACTGTGCTCACTGACTTTCTTCCCTCCTCGGAAAAGCAATAACGTGGGGTAGCCTC 
GT 

SEQIDNO:166 
5 Microtubule-associated protein 4 (MAP4) 
>GL1_19JKIGH_3_G3F1 Direction: sense 

CCACCTCCATTTCTGGAGGCAGACATACGTTCTTGATGAGAACTACTTCTGTTTCTGGAGGTGGAGTCACATCCT 
TGCCCAGAGCCATTTCTGTTTCTGATAGTG 

10 SEQIDNO:167 

>GIil_l9_HIGH_3_G3Rl Direction: anti- sense 

CACTATCAGAAACAGAAATGGCTCTGGGCAAGGATGTGACTCCACCTCCAGAAACAGAAGTAGTTCTCATCAAGA 
ACGTATGTCTGCCTCCAGAAATGGAGGTGG 

15 SEQIDNO:168 

>GLl_19_LOW_2_G3Fl Direction: sense 

CCACCTCCATTTCTGGAGGCAGACATACGTTCTTGATGAGAACTACTTCTGTTTCTGGAGGTGGAGTCANATCCT 
TGCCCAGAGCCATTTCTGTTTCTGATAGTG 



20 SEQIDNO:169 

>gi 1 14195631 1 ref |NM_002375 . 2 | Homo sapiens microtubule-associated protein 4 
(MAP4) , transcript variant 1, mRNA 

TGCGACCGCCTCCCTGCGCCCCGCCCCTCCGGCTAGCTCGCTGGCTCCCGGCTCCTCCCGACGTCTCCTA 
CCTCCTCACGGCTCTTCCCGGCGCTCTCCTGGCTCCCTTCTGCCCCAGCTCCGTCTCGGCGGCGGCGGGC 

25 AGTTGCAGTGGTGCAGAATGGCTGACCTCAGTCTTGCAGATGCATTAACAGAACCATCTCCAGACATTGA 
GGGAGAGATAAAGCGGGACTTCATTGCCACACTAGAGGCAGAGGCCTTTGATGATGTTGTGGGAGAAACT 
GTTGGAAAAACAGACTATATTCCTCTCCTGGATGTTGATGAGAAAACCGGGAACTCAGAGTCAAAGAAGA 
AACCGTGCTCAGAAACTAGCCAGATTGAAGATACTCCATCTTCTAAACCAACACTCCTAGCCAATGGTGG 
TCATGGAGTAGAAGGGAGCGATACTACAGGGTCTCCAACTGAATTCCTTGAAGAGAAAATGGCCTACCAG 

30 GAATACCCAAATAGCCAGAACTGGCCAGAAGATACCAACTTTTGTTTCCAACCTGAGCAAGTGGTCGATC 
CTATCCAGACTGATCCCTTTAAGATGTACCATGATGATGACCTGGCAGATTTGGTCTTTCCCTCCAGTGC 
GACAGCTGATACTTCAATATTTGCAGGACAAAATGATCCCTTGAAAGACAGTTACGGTATGTCTCCCTGC 
AACACAGCTGTTGTACCTCAGGGGTGGTCTGTGGAAGCCTTAAACTCTCCACACTCAGAGTCCTTTGTTT 
CCCCAGAGGCTGTTGCAGAACCTCCTCAGCCAACGGCAGTTCCCTTAGAGCTAGCCAAGGAGATAGAAAT 

35 GGCATCAGAAGAGAGGCCACCAGCACAAGCATTGGAAATAATGATGGGACTGAAGACTACTGACATGGCA 
CCATCTAAAGAAACAGAGATGGCCCTCGCCAAGGACATGGCACTAGCTACAAAAACCGAGGTGGCATTGG 
CTAAAGATATGGAATCACCCACCAAATTAGATGTGACACTGGCCAAGGACATGCAGCCATCCATGGAATC 
AGATATGGCCCTAGTCAAGGACATGGAACTACCCACAGAAAAAGAAGTGGCCCTGGTTAAGGATGTCAGA 
TGGCCCACAGAAACAGATGTATCTTCAGCCAAGAATGTGGTACTGCCCACAGAAACAGAGGTAGCCCCAG 

40 CCAAGGATGTGACACTGTTGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTAGCCCCTTCCAA 
GGACATGGGACCACCCAAAGAAAACAAGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTGGCT 
CCTTCCAAGGACATGGGACCACCCAAAGAAAACAAGATAGTCCCAGCCAAGGATTTGGTATTACTCTCAG 
AAATAGAGGTGGCACAGGCTAATGACATTATATCATCCACAGAAATATCCTCTGCTGAGAAGGTGGCTTT 
GTCCTC AGAAACAGAGGTAGCC CTGGCCAGGGACATGACACTGCC CCCGGAAAC CAACGTGATCTTGACC 

45 AAGGATAAAGCACTACCTTTAGAAGCAGAGGTGGCCCCAGTCAAGGACATGGCTCAACTCCCAGAAACAG 
AAATAGCCCCGGCCAAGGATGTGGCTCCGTCCACAGTAAAAGAAGTGGGCTTGTTGAAGGACATGTCTCC 
ACTATCAGAAACAGAAATGGCTCTGGGCAAGGATGTGACTCCACCTCCAGAAACAGAAGTAGTTCTCATC 
AAGAACGTATGTCTGCCTCCAGAAATGGAGGTGGCCCTGACTGAGGATCAGGTCCCAGCCCTCAAAACAG 
AAGCACCCCTGGCTAAGGATGGGGTTCTGACCCTGGCCAACAATGTGACTCCAGCCAAAGATGTTCCACC 

5 0 ACTCTCAGAAACAG AGGCAAC ACCAGTTCC AATTAAAGACATGGAAATTGC ACAAACACAAAAAGGAATA 
AGTGAGGATTCCCATTTAGAATCTCTGCAGGATGTGGGGCAGTCAGCTGCACCTACTTTCATGATTTCAC 
CAGAAACCATCACAGGAACGGGGAAAAAGTGCAGCTTGCCGGCCGAGGAGGATTCTGTGTTAGAAAAACT 
AGGGGAAAGGAAACCATGCAACAGTCAACCTTCTGAGCTTTCTTCAGAGACCTCAGGAATAGCCAGGCCA 
GAAGAAGGAAGGCCTGTGGTGAGTGGGACAGGAAATGACATCACCACCCCACCGAACAAGGAGCTCCCAC 

55 CAAGCCCAGAGAAGAAAACAAAGCCTTTGGCCACCACTCAACCTGCAAAGACTTCAACATCGAAAGCCAA 
AACACAGCCCACTTCTCTCCCTAAGCAGCCAGCTCCCACCACCATTGGTGGGTTGAATAAAAAACCCATG 
AGCCTTGCTTCAGGCTTAGTGCCAGCTGCCCCACCCAAACGCCCTGCCGTCGCCTCTGCCAGGCCTTCCA 
TCTTACCTTCAAAAGACGTGAAGCCAAAGCCCATTGCAGATGCAAAGGCTCCTGAGAAGCGGGCCTCACC 
ATCCAAGCCAGCTTCTGCCCCAGCCTCCAGATCTGGGTCCAAGAGCACTCAGACTGTTGCAAAAACCACA 

60 ACAGCTGCTGCTGTTGCCTCAACTGGCCCAAGCAGTAGGAGCCCCTCCACGCTCCTGCCCAAGAAGCCCA 
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CTGCCATTAAGACTGAGGGAAAACCTGCAGAAGTCAAGAAGATGACTGCAAAGTCTGTACCAGCTGACTT 
GAGTCGCCCAAAGAGCACCTCCACCAGTTCCATGAAGAAAACCACCACTCTCAGTGGGACAGCCCCCGCT 
GCAGGGGTGGTTCCCAGCCGAGTC^GGCCACACCCATGCCCTCCCGGCCCTCCACAACTCCTTTCATAG 
ACAAGAAGCCCACCTCGGCCAAACCCAGCTCCACCACCCCCCGGCTCAGCCGCCTGGCCACCAATACTTC 
5 TGCTCCTGATCTGAAGAATGTCCGCTCCAAGGTTGGCTCCACGGAAAACATCAAGCATCAGCCTGGAGGA 
GGCCGGGCCAAAGTAGAGAAAAAAACAGAGGCAGCTGCTACAACCCGAAAGCCTGAATCTAATGCAGTCA 
CTAAAACAGCCGGCCCAATTGCAAGTGCACAGAAACAACCTGCGGGGAAAGTCCAGATAGTCTCCAAAAA 
AGTGAGCTACAGCCATATTCAGTCCAAGTGTGGTTCCAAGGACAATATTAAGCATGTCCCTGGAGGTGGT 
AATGTTCAGATTCAGAACAAGAAAGTGGACATCTCTAAGGTCTCCTCCAAGTGTGGGTCTAAGGCTAACA 

1 0 TCAAGCACAAGCCTGGTGGAGGAGATGTCAAGATTGAAAGTCAGAAGTTGAACTTCAAGGAGAAGGCCCA 
GGCCAAGGTGGGATCCCTCGATAATGTGGGCCACCTACCTGCAGGAGGTGCTGTGAAGACTGAGGGCGGT 
GGCAGCGAGGCTCCTCTGTGTCCGGGTCCCCCTGCTGGGGAGGAGCCGGCCATCTCTGAGGCAGCGCCTG 
AAGCTGGCGCCCCCACTTCAGCCAGTGGCCTCAATGGCCACCCCACCCTGTCAGGGGGTGGTGACCAAAG 
GGAGGCCCAGACC77TGGACAGCCAGATCCAGGAGACAAGCATCTAATGATGACATTCTGGTCTCGTCTTC 

15 CGTCTCCCCCGTGTTCCCCTCTTGTCTCCCCTGTTCCCCTCTCCCTTCCCTCCTCCCATGTCACTGCAGA 
TTGAGACCTACAGGCTGACGTTCCGGGCAAATGCCAGGGCCCGCACCGACCACGGGGCCGACATTGTCTC 
CCGCCCCCCACACTTCCCTGGCGGCCCCAACTCGGGCTCCCGGGTCCTTGGCCCCCTTTCCCGGGCTGTC 
CACTAGACCAGTGAGCGCTTGGGCGCCGTGCTGGGCAGCCCGCTAGGCTCGCCTTCCCTCCTGCTTTGCG 
TGCCCGGGGCAGCAGCAGCCCTGCCCCACACCTCCTCTCACTCCCCAGCCTGGGCCCATCTCCCTGCTTT 

20 GGTCTTGCCCCATCACTGCGCCACTGCTCCGTGGAGGAGGTTGGGAGGGGGTTGGGGTGGTTGAGGCTAA 
GTTGGGATCTAGGAGAGGAGAACCAGATTCTATCCTCATCTTTTTTTGGTTCTTTGGTCCAAACCCAAAA 
GAAACTGACATGCCCTCCCTTCTCCCTGGATCTACCTGGAGGGAAGAGTGGAGGTGGATTCCGAGTGGTG 
ACAGGACGCTGACCGTGGAGCTTAAGCCACTGCCTCTCCCTCTGGTCCCACAAATGGGCGCCCCCCCCTC 
CCCATGCAGGTGGTGTCGGGCCCTTCTTGCTGCCCTGCCCCAAGTTGGGGGTCAGTGCTGCCTGTCCCCA 

25 TGCTTAACATACCCGCCTAGCTGCTGTCACATTTTTCTTGTTTTGTCCTTTTATTTTTTTCTAATAACCT 
AAAAACTGGCAAAATAGTTCTGCAGGTTGAAGCCATGTCTAGATGAAAGTCCTCAGTAAGTGTTAGAGGG 
AACAGGGCGGAGATATCCTTATGCCACCCCCGCTGGAGGATGTGGGCAGCTTAGGGCCCTGGAGGCGGTG 
CGGCAGGGAAGAGGGGTGCAGAGGCTGTGGCTGGTGAGCCGGTCAGGCACACAAGGGGCCCTTGGAGCGT 
GGACTGGTTGGTTTTGCCATTTTGTTGTGTGTATGCTGCTTTTCTTTTCTAACCAAGAGGCTGGTTTTGG 

30 CATCTCTGTCCCATTCCCTGGGATCTGGTGGTCAGCCCTAGGATAAAAAGCCAGGGCTGGAGAACAAGAA 
AGGGCCAGGAGATGAATTC 

SEQIDNO:170 

>gi |4505099|ref |NP_002366.l| microtubule -associated protein 4, isoform 1 
35 [Homo sapiens] 

MADLSLADALTEPSPDIEGEIKRDFIATLE^ 
SQIEDTPSSKPTLLANGGHGVEGSDTTGSPT^^ 

FKMYHDDDLADIiVFPS S ATADTS I FAGQNDPLKD S YGMS PCNTAWPQGWS VE ALNS PHS E S FVSPEAVA 
EPPQPTAVPLELAKEIEMASEERPPAQALEIMMGLKTT^ 
40 PTKLDVTLAKDMQPSMESDMAIjVKDMELPTEKE^ 

LKETEI^SPIKl^LAPSKDMGPPKENK^TERASPIKMDLAPSKDMGPPKEN 
AND 1 1 S STE I S S AEKVALS S ETEVALARDMTLPPETNVILTKDKAL 
DVAPSTVKEVGLLKDMSPLSETEMALGKDVTPPPETEVV^ 
DGVLTLANTWTPAKDVPPLSBTEATPVPIKD^ 

45 TGKKCSLPAEEDSVLEKIjGERKPCNSQPSELSSETSGIARPEEGRPVVSGTGNDITTPPNKELPPSPEKK 
TKPLATTQPAKTSTSKAKTQPTSLPKQPA^ 

WPKPIADAKAPEK^SPSKPASAPASRSGSKSTQTVAKTTTAAAVASTGPSSRSPSTLLPK^ 
GKPAEVKKMTAKSVPADLSRPKSTSTSSMKKTTTLSGTAPAAGVVPSRVKATPMPSRPSTTPFIDKKPTS 
AKPS STTPRLSRLATNTS APDLKNVRS KVGSTENI BCHQPGGGRAKVEKKTEAAATTRKPESNAVTKTAGP 
50 IASAQKQPAGKVQIVSKJCVSYSHIQSKCGSKDNIKHVPGGGNVQIQNKKVDISKVSSK 

GGDVKIESQKLNFKEKAQAKVGSLD1WGHLPAGGAVKTEGGGSEAPLCPGPPAGEEPAISEAAPEAGAPT 
SASGIiNGHPTLSGGGDQREAQTIiDSQIQETSI 

SEQIDNO:171 

55 >gi 1 14195625 jref |NM_030884 . 1 | Homo sapiens microtubule-associated protein 4 
(MAP4) , transcript variant 2, mRNA 

TGCGACCGCCTCCCTGCGCCCCGCCCCTCCGGCTAGCTCGCTGGCTCCCGGCTCCTCCCGACGTCTCCTA 



60 



CCTCCTCACGGCTCTTCCCGGCGCTCTCCTGGCTCCCTTCTGCCCCAGCTCCGTCTCGGCGGCGGCGGGC 
AGTTGCAGTGGTGCAGAATGGCTGACCTCAGTCTTGCAGATGCATTAACAGAACCATCTCCAGACATTGA 
GGGAGAGATAAAGCGGGACTTCATTGCCACACTAGAGGCAGAGGCCTTTGATGATGTTGTGGGAGAAACT 
GTTGGAAAAACAGACTATATTCCTCTCCTGGATGTTGATGAGAAAACCGGGAACTCAGAGTCAAAGAAGA 
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AA.CCGTGCTCAGAAACTAGCCAGATTGAAGATACTCCATCTTCTAAACCAACACTCCTAGCCAATGGTGG 
TCATGGAGTAGAAGGGAGCGATACTACAGGGTCTCCAACTGAATTCCTTGAAGAGAAAATGGCCTACCAG 
GAATACCCAAATAGCCAGAACTGGCCAGAAGATACCAACTTTTGTTTCCAACCTGAGCAAGTGGTCGATC 
CTATCCAGACTGATCCCTTTAAGATGTACCATGATGATGACCTGGCAGATTTGGTCTTTCCCTCCAGTGC 
5 GACAGCTGATACTTCAATATTTGCAGGACAAAATGATCCCTTGAAAGACAGTTACGGTATGTCTCCCTGC 
AACACAGCTGTTGTACCTCAGGGGTGGTCTGTGGAAGCCTTAAACTCTCCACACTCAGAGTCCTTTGTTT 
CCCCAGAGGCTGTTGCAGAACCTCCTCAGCCAACGGCAGTTCCCTTAGAGCTAGCCAAGGAGATAGAAAT 
GGCATCAGAAGAGAGGCCACCAGCACAAGCATTGGAAATAATGATGGGACTGAAGACTACTGACATGGCA 
CCATCTAAAGAAACAGAGATGGCCCTCGCCAAGGACATGGCACTAGCTACAAAAACCGAGGTGGCATTGG 

10 CTAAAGATATGGAATCACCCACCAAATTAGATGTGACACTGGCCAAGGACATGCAGCCATCCATGGAATC 
AGATATGGCCCTAGTCAAGGACATGGAACTACCCACAGAAAAAGAAGTGGCCCTGGTTAAGGATGTCAGA 
TGGCCCACAGAAACAGATGTATCTTCAGCCAAGAATGTGGTACTGCCCACAGAAACAGAGGTAGCCCCAG 
CCAAGGATGTGACACTGTTGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTAGCCCCTTCCAA 
GGACATGGGACCACCCAAAGAAAACAAGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTGGCT 

15 CCTTCCAAGGACATGGGACCACCCAAAGAAAACAAGATAGTCCCAGCCAAGGATTTGGTATTACTCTCAG 
AAATAGAGGTGGCACAGGCTAATGACATTATATCATCCACAGAAATATCCTCTGCTGAGAAGGTGGCTTT 
GTCCTCAGAAACAGAGGTAGCCCTGGCCAGGGACATGACACTGCCCCCGGAAACCAACGTGATCTTGACC 
AAGGATAAAGCACTACCTTTAGAAGCAGAGGTGGCCCCAGTCAAGGACATGGCTCAACTCCCAGAAACAG 
AAATAGCCCCGGCCAAGGATGTGGCTCCGTCCACAGTAAAAGAAGTGGGCTTGTTGAAGGACATGTCTCC 

20 ACTATCAGAAACAGAAATGGCTCTGGGCAAGGATGTGACTCCACCTCCAGAAACAGAAGTAGTTCTCATC 
AAGAACGTATGTCTGCCTCCAGAAATGGAGGTGGCCCTGACTGAGGATCAGGTCCCAGCCCTCAAAACAG 
AAGCACCCCTGGCTAAGGATGGGGTTCTGACCCTGGCCAACAATGTGACTCCAGCCAAAGATGTTCCACC 
ACTCTCAGAAACAGAGGCAACACCAGTTCCAATTAAAGACATGGAAATTGCACAAACACAAAAAGGAATA 
AGTGAGGATTCCCATTTAGAATCTCTGCAGGATGTGGGGCAGTCAGCTGCACCTACTTTCATGATTTCAC 

25 CAGAAACCATCACAGGAACGGGGAAAAAGTGCAGCTTGCCGGCCGAGGAGGATTCTGTGTTAGAAAAACT 
AGGGGAAAGGAAACCATGCAACAGTCAACCTTCTGAGCTTTCTTCAGAGACCTCAGGAATAGCCAGGCCA 
GAAGAAGGAAGGCCTGTGGTGAGTGGGACAGGAAATGACATCACCACCCCACCGAACAAGGAGCTCCCAC 
CAAGCCCAGAGAAGAAAACAAAGCCTTTGGCCACCACTCAACCTGCAAAGACTTCAACATCGAAAGCCAA 
AACACAGCCCACTTCTCTCCCTAAGCAGCCAGCTCCCACCACCATTGGTGGGTTGAATAAAAAACCCATG 

30 AGCCTTGCTTCAGGCTTAGTGCCAGCTGCCCCACCCAAACGCCCTGCCGTCGCCTCTGCCAGGCCTTCCA 
TCTTACCTTCAAAAGACGTGAAGCCAAAGCCCATTGCAGATGCAAAGGCTGCTGAGAAGCGGGCCTCACC 
ATCCAAGCCAGCTTCTGCCCCAGCCTCCAGATCTGGGTCCAAGAGCACTCAGACTGTTGCAAAAACCACA 
ACAGCTGCTGCTGTTGCCTCAACTGGCCCAAGCAGTAGGAGCCCCTCCACGCTCCTGCCCAAGAAGCCCA 
CTGCCATTAAGACTGAGGGAAAACCTGCAGAAGTCAAGAAGATGACTGCAAAGTCTGTACCAGCTGACTT 

35 GAGTCGCCCAAAGAGCACCTCCACCAGTTCCATGAAGAAAACCACCACTCTCAGTGGGACAGCCCCCGCT 
GCAGGGGTGGTTCCCAGCCGAGTCAAGGCCACACCCATGCCCTCCCGGCCCTCCACAACTCCTTTCATAG 
ACAAGAAGCCCACCTCGGCCAAACCCAGCTCCACCACCCCCCGGCTCAGCCGCCTGGCCACCAATACTTC 
TGCTCCTGATCTGAAGAATGTCCGCTCCAAGGTTGGCTCCACGGAAAACATCAAGCATCAGCCTGGAGGA 
GGCCGGGTTCAGATAGTCTCCAAAAAAGTGAGCTACAGCCATATTCAGTCCAAGTGTGGTTCCAAGGACA 

40 ATATTAAGCATGTCCCTGGAGGTGGTAATGTTCAGATTCAGAACAAGAAAGTGGACATCTCTAAGGTCTC 
CTCCAAGTGTGGGTCTAAGGCTAACATCAAGCACAAGCCTGGTGGAGGAGATGTCAAGATTGAAAGTCAG 
AAGTTGAACTTCAAGGAGAAGGCCCAGGCCAAGGTGGGATCCCTCGATAATGTGGGCCACCTACCTGCAG 
GAGGTGCTGTGAAGACTGAGGGCGGTGGCAGCGAGGCTCCTCTGTGTCCGGGTCCCCCTGCTGGGGAGGA 
GCCGGCCATCTCTGAGGCAGCGCCTGAAGCTGGCGCCCCCACTTCAGCCAGTGGCCTCl^ATGGCCACCCC 

45 ACCCTGTCAGGGGGTGGTGACCAAAGGGAGGCCCAGACCTTGGACAGCCAGATCCAGGAGACAAGCATCT 
AATGATGACATTCTGGTCTCGTCTTCCGTCTCCCCCGTGTTCCCCTCTTGTCTCCCCTGTTCCCCTCTCC 
CTTCCCTCCTCCCATGTCACTGCAGATTGAGACCTACAGGCTGACGTTCCGGGCAAATGCCAGGGCCCGC 
ACCGACCACGGGGCCGACATTGTCTCCCGCCCCCCACACTTCCCTGGCGGCCCCAACTCGGGCTCCCGGG 
TCCTTGGCCCCCTTTCCCGGGCTGTCCACTAGACCAGTGAGCGCTTGGGCGCCGTGCTGGGCAGCCCGCT 

50 AGGCTCGCCTTCCCTCCTGCTTTGCGTGCCCGGGGCAGCAGCAGCCCTGCCCCACACCTCCTCTCACTCC 
CCAGCCTGGGCCCATCTCCCTGCTTTGGTCTTGCCCCATCACTGCGCCACTGCTCCGTGGAGGAGGTTGG 
GAGGGGGTTGGGGTGGTTGAGGCTAAGTTGGGATCTAGGAGAGGAGAACCAGATTCTATCCTCATCTTTT 
TTTGGTTCTTTGGTCCAAACCCAAAAGAAACTGACATGCCCTCCCTTCTCCCTGGATCTACCTGGAGGGA 
AGAGTGGAGGTGGATTCCGAGTGGTGACAGGACGCTGACCGTGGAGCTTAAGCCACTGCCTCTCCCTCTG 

55 GTCCCACAAATGGGCGCCCCCCCCTCCCCATGCAGGTGGTGTCGGGCCCTTCTTGCTGCCCTGCCCCAAG 
TTGGGGGTCAGTGCTGCCTGTCCCCATGCTTAACATACCCGCCTAGCTGCTGTCACATTTTTCTTGTTTT 
GTCCTTTTATTTTTTTCTAATAACCTAAAAACTGGCAAAATAGTTCTGCAGGTTGAAGCCATGTCTACAT 
GAAAGTCCTCAGTAAGTGTTAGAGGGAACAGGGCGGAGATATCCTTATGCCACCCCCGCTGGAGGATGTG 
GGCAGCTTAGGGCCCTGGAGGCGGTGCGGCAGGGAAGAGGGGTGCAGAGGCTGTGGCTGGTGAGCCGGTC 

60 AGGCACACAAGGGGCCCTTGGAGCGTGGACTGGTTGGTTTTGCCATTTTGTTGTGTGTATGCTGCTTTTC 
TTTTCTAACCAAGAGGCTGGTTTTGGCATCTCTGTCCCATTCCCTGGGATCTGGTGGTCAGCCCTAGGAT 
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AAAAAGCCAGGGCTGGAGAACAAGAAAGGGCCAGGAGATGAATTC 

SEQIDNO:172 

>gi | 14195626 | ref |NP_112146 . 1 | mi crotubule- associated protein 4, isoform 2 
5 [Homo sapiens] 

MADLSLADALTEPSPDIEGEIKRDFIATLEA^ 

SQIEDTPSSKPTLLANGGHGVEGSDTTGSPTEFLEEKMAYQEYPNSQNWPEDTNFCFQPEQVVDPIQTDP 

FKMYHDDDLADLVFPSSATADTSIFAGQNDPLKDSYGMSPCl^ 

EPPQPTAVPLELAKEIE^SEERPPAQALEIMMGLKTTDMAPSKE 
1 0 PTKLDVTLAKDMQPSMESDMALVKDMELPT^ 

LKETERASPIKiynDLAPSKDMGPPKENKKETERASPIKMDLAPSm 

ANDIISSTEISSAEKVALSSETEVALARDMTLPPET^ 

DVAPSTVKEVGLLKDMSPLSETEKALGKDVTPPPETEVVLIKNVCLPPEM 

DGVLTIiANNVTPAKDVPPLSETEATPVPIKDMEIAQTQKGISEDS 
15 TGKKCSLPAEEDSVLEKLGERKPCNSQPSELSSETSGIARPEEGRPVVSGTGNDITTPPNKELPPSPEKK 

TKPLATTQPAKTSTSKAKTQPTSLPKQPAPTTIGGI^ 

VKPKPIADAKAPEKRASPSKPASAPASRSGSKSTQTVAKTTTAAAVASTGPSSRSPSTLLi 
GKPAEVKKMTAKSVPADLSRPKSTSTSSMKKTTT^ 
AKPS STTPRLSRLATOTSAPDLKNWSKV^ 
20 GGGNVQ I QNKKVD I S KVS S KCG S KANT KHKPGGGD VKI E S QKLNF KEKAQAKVGS LDNVGHLP AGG AVKT 
EGGGSEAPLCPGPPAGEEPAISEAAPEAGAPTSASGIiNGHPTIiSGGGDQREAQTLDSQIQETSI 

SEQIDNO:173 

>gi | 14195627 | ref |NM_030885 . 1 | Homo sapiens microtubule-associated protein 4 

25 (MAP4) , transcript variant 3, mRNA 

TGCGACCGCCTCCCTGCGCCCCGCCCCTCCGGCTAGCTCGCTGGCTCCCGGCTCCTCCCGACGTCTCCTA 
CCTCCTCACGGCTCTTCCCGGCGCTCTCCTGGCTCCCTTCTGCCCCAGCTCCGTCTCGGCGGCGGCGGGC 
AGTTGCAGTGGTGCAGAATGGCTGACCTCAGTCTTGCAGATGCATTAACAGAACCATCTCCAGACATTGA 
GGGAGAGATAAAGCGGGACTTCATTGCCACACTAGAGGCAGAGGCCTTTGATGATGTTGTGGGAGAAACT 

30 GTTGGAAAAACAGACTATATTCCTCTCCTGGATGTTGATGAGAAAACCGGGAACTCAGAGTCAAAGAAGA 
AACCGTGCTCAGAAACTAGCCAGATTGAAGATACTCCATCTTCTAAACCAACACTCCTAGCCAATGGTGG 
TCATGGAGTAGAAGGGAGCGATACTACAGGGTCTCCAACTGAATTCCTTGAAGAGAAAATGGCCTACCAG 
GAATACCCAAATAGCCAGAACTGGCCAGAAGATACCAACTTTTGTTTCCAACCTGAGCAAGTGGTCGATC 
CTATCCAGACTGATCCCTTTAAGATGTACCATGATGATGACCTGGCAGATTTGGTCTTTCCCTCCAGTGC 

35 GACAGCTGATACTTCAATATTTGCAGGACAAAATGATCCCTTGAAAGACAGTTACGGTATGTCTCCCTGC 
AACACAGCTGTTGTACCTCAGGGGTGGTCTGTGGAAGCCTTAAACTCTCCACACTCAGAGTCCTTTGTTT 
CCCCAGAGGCTGTTGCAGAACCTCCTCAGCCAACGGCAGTTCCCTTAGAGCTAGCCAAGGAGATAGAAAT 
GGCATCAGAAGAGAGGCCACCAGCACAAGCATTGGAAATAATGATGGGACTGAAGACTACTGACATGGCA 
CCATCTAAAGAAACAGAGATGGCCCTCGCCAAGGACATGGCACTAGCTACAAAAACCGAGGTGGCATTGG 

40 CTAAAGATATGGAATCACCCACCAAATTAGATGTGACACTGGCCAAGGACATGCAGCCATCCATGGAATC 
AGATATGGCCCTAGTCAAGGACATGGAACTACCCACAGAAAAAGAAGTGGCCCTGGTTAAGGATGTCAGA 
TGGCCCACAGAAACAGATGTATCTTCAGCCAAGAATGTGGTACTGCCCACAGAAACAGAGGTAGCCCCAG 
CCAAGGATGTGACACTGTTGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTAGCCCCTTCCAA 
GGACATGGGACCACCCAAAGAAAACAAGAAAGAAACAGAGAGGGCATCTCCTATAAAAATGGACTTGGCT 

45 CCTTCCAAGGACATGGGACCACCCAAAGAAAACAAGATAGTCCCAGCCAAGGATTTGGTATTACTCTCAG 
AAATAGAGGTGGCACAGGCTAATGACATTATATCATCCACAGAAATATCCTCTGCTGAGAAGGTGGCTTT 
GTCCTCAGAAACAGAGGTAGCCCTGGCCAGGGACATGACACTGCCCCCGGAAACCAACGTGATCTTGACC 
AAGGATAAAGCACTACCTTTAGAAGCAGAGGTGGCCCCAGTCAAGGACATGGCTCAACTCCCAGAAACAG 
AAATAGCCCCGGCCAAGGATGTGGCTCCGTCCACAGTAAAAGAAGTGGGCTTGTTGAAGGACATGTCTCC 

5 0 ACTATCAGAAACAGAAATGGCTCTGGGCAAGGATGTGACTCCACCTCCAGAAACAGAAGTAGTTCTCATC 
AAGAACGTATGTCTGCCTCCAGAAATGGAGGTGGCCCTGACTGAGGATCAGGTCCCAGCCCTCAAAACAG 
AAGCACCCCTGGCTAAGGATGGGGTTCTGACCCTGGCCAACAATGTGACTCCAGCCAAAGATGTTCCACC 
ACTCTCAGAAACAGAGGCAACACCAGTTCCAATTAAAGACATGGAAATTGCACAAACACAAAAAGGAATA 
AGTGAGGATTCCCATTTAGAATCTCTGCAGGATGTGGGGCAGTCAGCTGCACCTACTTTCATGATTTCAC 

5 5 CAGAAACCATCACAGGAACGGGGAAAAAGTGCAGCTTGCCGGCCGAGGAGGATTCTGTGTTAGAAAAACT 
AGGGGAAAGGAAACCATGCAACAGTCAACCTTCTGAGCTTTCTTCAGAGACCTCAGGAATAGCCAGGCCA 
GAAGAAGGAAGGCCTGTGGTGAGTGGGACAGGAAATGACATCACCACCCCACCGAACAAGGAGCTCCCAC 
CAAGCCCAGAGAAGAAAACAAAGCCTTTGGCCACC^CTCAACCTGCAAAGACTTCAACATCGAAAGCCAA 
AACACAGCCCACTTCTCTCCCTAAGCAGCCAGCTCCCACCACCATTGGTGGGTTGAATAAAAAACCCATG 

60 AGCCTTGCTTCAGGCTTAGTGCCAGCTGCCCCACCCAAACGCCCTGCCGTCGCCTCTGCCAGGCCTTCCA 
TCTTACCTTCAAAAGACGTGAAGCC7UUIGCCCATTGCAGATGCAAAGGCTCCTGAGAAGCGGGCCTCACC 
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aaekiertvqieastveie^vklrltnatdtpgygdaincrdcfktiisyideWrylhdesg^ot 

IIDNRVHCCFYFISPFGHGIiKPIiDVAFMK^ 
DGDGGALGHHV 

SEQE>NO:185 , . 

Novel (breast cancer antigen; tyrosine/ serine/threonine kinase) 

cagg^gSS 

agwSccaggSSg^ 
atotcccctotaaggatatgcctgccaccc^ 

ataccagc^catccctgctgaggagctgaccatgagcccttcgcgctcagtggccaactctggcctgagtgaca 
gcctgaccaccagcacccc 

SEQ ID N0.186 
caggSggtg^ 

cagSagSotccacgattttcccaagtacagtgtcaaggct 
otccccStSgg^ 

CTGACC 

SEQ ID NO: 187 

cagaggctca^ 

SgaaSStggcc^^ 

??2SaagnSgtagtttctctcattcaaaangaggggagagaaanctggca 

GGGGGGTGCTG 

SEQIDNO:188 

PAGGGGGTCCTGAAGGCC^ 



CAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 

otcccSSggatatgcctgccacccagatgctgctagagaaactgaacggcacagtgccc 

CTGACCAACAG 



SEQ ED NO: 189 

^r*r ^ i ^6 B2 M13R Direction: anti-sense 

cagaggctcIgaactcccaatcgtccacctccagctcttccaggtttgttaccaggccaaagattccactgtgot 

CCTGAGACTGGCTGCCCTCAAAATTGGTGATGGGGGTGACAGGACGAAGCAATTC 
GTCGCTTGATCTGGGGGAGAAGAGAGAGGTGGGTGACAGATCCTGTTGCTCTGGGTCCCAGGAC^ 

agSaaccatggcctggtggcagacgggctgtcggagccaactccatgagaggaaggagcagtgtctttcaggggc 
ttcggaagcggggtagtttctctcattcaaaaggaggggagagaaagctg 



SEQ1DNO:190 

i 36 B3 M13F Direction: anti-sense 

CAGAGGCTCAGAACTCCCAATCGTCCACCTCCAGCTCT 
CCTGAGACTG^CTGCCCTCA^ 

GTCGCTTGATCTGGGGGAGAAGAGAGAGGTGGGTGACAGATCCTGTTGCTCTGGGTCCCAGGACACCATGGGGC^ 
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aggaaccatggcctggtggcagacgggctgtcggagccaactccatgagaggaaggagSagtgtctttcaggggc 
ttcggaagcggggtagtttctctcattcaaaaggaggggagagaaagctggcatccgggttgcgctgaaggcact 

GCTCCACAAAGTGGTGGAAGTGGGGGGAGAAGGTTCGGTGGTAGGGGTGGGAGGGCGAGTCACCGTNGGAGGGCC 
GGGGGGTGCTGGTGGTC^GCTGTCACTCAGGCC^GAGTTGGNCACTGATCGCGAANGGCTCATGGTCAGNTCCT 
CANCAGGGATGGTGCTGGTATCCAACAGGCAGGGCACTGTGCCGTTCAGTTTCTCTNAGCAGCATCTGGGTGGCA 
GGCATATCCTTAAAGGGGACATGGCCGTTGGCCACTCACAGGCCTGT 

SEQIDNO:191 

>GIj3_1.3 6_B3_M13R Direction: sense 

CAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAAGCCAGCCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 
AATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGCCAT 
GTCCCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGAT 
ACCAGCACCATCCCTGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGC 
CTGACCACCAG 



>GL3_1 . 36_B4_M13F Direction: anti-sense 
CAGAGGCTCAGAACTCCCAATCGTCCACCTCCAGCTCTTCCAGGTTTGTTACCAGGCCAAAGATTCCACTGTGGT 
CCTGAGACTGGCTGCCCTCAAAATTGGTGATGGGGGTGACAGGACGAAGCAATTCGGGCAAAGCCTCTGAGGCAC 
GTCGCTTGATCTGGGGGAGAAGAGAGAGGTGGGTGACAGATCCTGTNGCTCTGGGTCCCAGGACACCATGGGGCA 
AGGAACCATGGCCTGGTGGCAGANNGACTGTCATAGCCAACNCCATGAGAGGAAGGAGCAGTGTCTTTCAGGGGC 
TTCGGAAGCGGGGTAGTTTCTCTCATTCAAAAGGAGGGGAGAGAAAGCTGGCATCCGGGTTGCGCTGAAGGCACT 
GCTCCACAAAGTGGTGGAAGTGGGGGGAGAAGGTTCGGTGGTAGGGGTGGGAGGGCGAGTCACCGTTGGAGGGCC 
GGGGGGTGCTGGTGGTCAGGCTGTCACTCANGCCAGAGTTGGCCACTGACCGCGAAGGGCTCATGGTCAGTTCCT 
CANCAGGGATGGTGCTGGTA 

SEQIDNO:193 

>GL3_JL.36_B4_M13R Direction: sense 

CAGGGGGTGCTG7LAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAAGCCAGCCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 
AATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGCCAT 
GTCCCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGAT 
ACCAGCACCATCCCTGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGC 
CTGACCACCAGCACCCCCC 

SEQIDNO:194 

>GL3_10.04JPCR_G3F1 Direction: N/A 

CAGGGGGTGCTGAAGGCCCTCGACTACGTCCACCACATGGGATATGTACACAGGAGTGTCAAAGCCAGCCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCGAGTGGTCCACGATTTNCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 
AATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGNCAT 
GTCCCCTTTAAGGATATGCCTG C C ACCCATATGCTGCTAG AGAAACTGAACGGCAC AGTGCCCTG 

SEQIDNO:195 

>GL3_3 .2 9_AJVI13F Direction: N/A 

CAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAAGCCAGCCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 
AATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGCCAT 
GTCCCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGAT 
ACCAGCACCATCCCTGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGC 
CTGACCACCAGCACCC 

SEQBDNO:196 

>GL3_3 .29_A_M13R Direction: anti-sense 

CAGAGGCTCAGAACTCCCAATCGTCCACCTCCAGCTCTTCC^GGTTTGTTACCAGGCCAAAGATTCCACTGTGGT 
CCTGAGACTGGCTGCCCTCAAAATTGGTGATGGGGGTGACAGGACGAAGCAATTCGGGCAAAGCCTCTGAGGCAC 



SEQIDNO:192 
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GTCGCTTGATCTGGGGGAGAAGAGAGAGGTGGGTGACAGATCCTGTTGCTCTGGGTCCCAGGACACCATGGGGCA 
AGGAACCATGGCCTGGTGGCAGACGGGCTGTCGGAGCCAACTCCATGAGAGGAAGGAGCAGTGTCTTTCAGGGGC 
TTCGGAAGCGGGGTAGTTTCTCTCATTCAAAAGGAGGGGAGAGAAAGCTGGCATCCGGGTTGCGCTGAAGGCACT 
GCTCCACAAAGTGGTGGAAGTGGGGGGAGAAGGTTCGGTGGTAGGGGTGGGAGGGCGAGTCACCGTTGGAGGGCC 
5 GGGGGGTGCTGGTGGTCAGGCTGTCACTCAGGCCAGAGTTGGCCACTGAGCGCGAAGGGCTCATGGTCAGCTCCT 
CAGCAGGGATGGTGCTGGTATCCAACAGGCAGGGCACTGTGCCGTTCAGTTTCTCTAGCAGCATCTGGGTGGCAG 
GCATAT CCTTAAAGGGGAC ATGGCCGTTGGC C AGTTCAC AGGCTGTGATTCCCACACT 



10 >GL3_3 . 29_B_M13F Direction: N/A 

CAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAAGCCAGCCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 
AATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGCCAT 
15 GTCCCCTTTAAGGATATGCCTGCCGCCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGAT 
ACCAGCACCATCCCTGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGC 
CTGACCACCAGCACCC 

SEQIDNO:198 

20 >GL3_3 . 29JBJM13R Direction: anti-sense 

CAGAGGCTCAGAACTCCCAATCGTCCACCTCCAGCTCTTCCAGGTTTGTTACCAGGCCAAAGATTCCACTGTGGT 
CCTGAGACTGGCTGCCCTCAAAATTGGTGATGGGGGTGACAGGACGAAGCAATTCGGGCAAAGCCTCTGAGGCAC 
GTCGCTTGATCTGGGGGAGAAGAGAGAGGTGGGTGACAGATCCTGTTGCTCTGGGTCCCAGGACACCATGGGGCA 
AGGAACGATGGCCTGGTGGCAGACGGGCTGTCGGAGCCAACTCCATGAGAGGAAGGAGCAGTGTCTTTCAGGGGC 

25 TTCGGAAGCGGGGTAGTTTCTCTCATTCAAAAGGAGGGGAGAGAAAGCTGGCATCCGGGTTGCGCTGAAGGCACT 
GCTCCACAAAGTGGTGGAAGTGGGGGGAGAAGGTTCGGTGGTA 

SEQIDNO:199 

>GL3 3.29 C M13F Direction: N/A 

30 

CAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAAGCCAGCCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAG 
AATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGCCAT 
GTCCCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGAT 
35 ACCAGCACCATCCCTGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGC 
CTGACCACCAGCACC 

SEQ ID NO:200 

>GI>3_3 . 29_C_M13R Direction: anti-sense 

40 CAGAGGCTCAGAACTCCCAATCGTCCACCTCCAGCTCTTCCAGGTTTGTTACCAGGCCAAAGATTCCACTGTGGT 
CCTGAGACTGGCTGCCCTCAAAATTGGTGATGGGGGTGACAGGACGAAGCAATTCGGGCAAAGCCTCTGAGGCAC 
GTCGCTTGATCTGGGGGAGAAGAGAGAGGTGGGTGACAGATCCTGTTGCTCTGGGTCCCAGGACACCATGGGGCA 
AGGAACCATGGCCTGGTGGCAGACGGGCTGTCGGAGCCAACTCCATGAGAGGAAGGAGCAGTGTCTTTCAGGGGC 
TTCGGAAGCGGGGTAGTTTCTCTCATTCAAAAGGAGGGGAGAGAAAGCTGGCATCCGGGTTGCGCTGAAGGCACT 

45 GCTCCACAAAGTGGTGGAAGTGGGGGGAGAAGGTTCGGTGGTAGGGGTGGGAGGGCGAGTCACCGTTGGAGGGCC 
GGGGGGTGCTGGTGGTCAGGCTGTCACTCAGGCCAGAGTTGGCCACTGAGCGCGAAGGGCTCATGGTCAGCTCCT 
CAGCAGGGATGGTGCTGGTATCCAACAGGCAGGGCACTGTGCCGTTCA 

SEQ ID NO:201 

50 >GL3_3 .29JDJYI13F Direction: N/A 

CAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAANCCAGNCACATC 
CTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATGATAAGCCATGGGCAGCGG 
CAGCNAGTGGTCCANGANN 

55 SEQ ID NO:202 

>gi | 20559344 | ref |XM_044379 . 3 | Homo sapiens similar to serologically defined 
breast cancer antigen NY-BR-96 (LOC92335) , mRNA 

GCGGCGCGGCAGTAAAACTGAGGAGGCGGAGCCAAGACGGTCGGGGCTGCTTGCTAACTCCAGGAACAGG 
TTTAAGTTTTTGAAACTGAAGTAGGCCTACACAGTAGGAACTCATGTCATTTCTTGTAAGTAAACCAGAG 
60 CGAATCAGGCGGTGGGTCTCGGAAAAGTTCATTGTTGAGGGCTTAAGAGATTTGGAACTATTTGGAGAGC 



SEQ ID NO: 197 
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AGCCTCCGGGTGACACTCGGAGAAAAACCAATGATGCGAGCTCAGAGTCAATAGC^TCCTTCTCTAAACA 
GGAGGTCATGAGTAGCTTTCTGCCAGAGGGAGGGTGTTACGAGCTGCTCACTGTGATAGGCAAAGGATTT 
GAGGACCTGATGACTGTGAATCTAGCAAGGTACAAACCAACAGGAGAGTACGTGACTGTACGGAGGATTA 
ACCTAGAAGCTTGTTCCAATGAGATGGTAACATTCTTGCAGGGCGAGCTGCATGTCTCCAAACTCTTCAA 
5 CCATCCCAATATCGTGCCATATCGAGCCACTTTTATTGCAGACAATGAGCTGTGGGTTGTCACATCATTC 
ATGGCATACGGTTCTGCAAAAGATCTCATCTGTACACACTTCATGGATGGCATGAATGAGCTGGCGATTG 
CTTACATCCTGCAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGT 
CAAAGCCAGCCACATCCTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGC 
ATGATAAGCCATGGGCAGCGGCAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGT 

10 GGCTCAGCCCCGAGGTCCTCCAGCAGAATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGG 
AATCACAGCCTGTGAACTGGCCAACGGCCATGTCCCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTA 
GAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGATACCAGCACCATCCCCGCTGAGGAGCTGACCATGA 
GCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGCCTGACCACCAGCACCCCCCGGCCCTCCAA 
CGGTGACTCGCCCTCCCACCCCTACCACCGAACCTTCTCCCCCC^CTTCCACCACTTTGTGGAGCAGTGC 

15 CTTCAGCGCAACCCGGATGCCAGGCCCAGTGCCAGCACCCTCCTGAACCACTCTTTCTTCAAGCAGATCA 
AGCGACGTGCCTCAGAGGCTTTGCCCGAATTGCTTCGTCCTGTCACCCCCATCACCAATTTTGAGGGCAG 
CCAGTCTCAGGACCACAGTGGAATCTTTGGCCTGGTAACAAACCTGGAAGAGCTGGAGGTGGACGATTGG 
GAGTTCTGAGCCTCTGCAAACTGTGCGCATTCTCCAGCCAGGGATGCAGAGGCCACCCAGAGGCCCTTCC 
TGAGGGCCGGCCACATTCCCGCCCTCCTGGGCAGATTGGGTAGAAAGGACATTCTTCCAGGAAAGTTGAC 

20 TGCTGACTGATTGGGAAAGAAAATCCTGGAGAGACACTTCACTGCTCCAAGGCTTTTGAGACACAAGGGA 
ATCTCAACAACCAGGGATCAGGAGGGTCCAAAGCCGACATTCCCAGTCCTGTGAGCTCAGGTGACCTCCT 
CCGCAGAAGAGAGATGCTGCTCTGGCCCTGGGAGCTGAATTCCAAGCCCAGGGTTTGGCTCCTTAAACCC 
GAGGACCGCCACCTCTTCCCAGTGCTTGCGACCAGCCTCATTCTATTTAACTTTGCTCTCAGATGCCTCA 
GATGCTATAGGTCAGTGAAAGGGCAAGTAGTAAGCTGCCTGCCTCCCTTCCCTCAGACCTCTCCCTCATA 

25 ATTCCAGAGAAGGGCATTTCTGTCTTTTTAAGCACAGACTAAGGCTGGAACAGTCCATCCTTATCCCTCT 
TCTGGCTTGGGCCCTGACACCTAAGTCTTTCCCACGGTTTATGTGTGTGCCTCATTCCTTTCCCACCAAG 
AATCCATCTTAGCGCCTCCTGCCAGCTGCCCTGGTGCTTTCTCCAAGGGCCATCAGTGTCTTGCCTAGCT 
TGAGGGCTTAAGTCCTTATGCTGTGTTAGTTTCGTTGTCAGAA 

30 SEQ ID NO:203 

>gi | 14770985 | ref )XP_044379 . 1 | similar to serologically defined breast 
cancer antigen NY-BR-96 [Homo sapiens] 

MSFLVSKPERIRRWSEKFIVEGLRDLELFGEQPPGDTRRKmDASSESIASFSKQEVMSSFLPEGGCYE 
LIiTVIGKGFEDLMTVNIiARYKPTC 
35 NELWVVTSFMAYGSAITOIilCra^ 

LSGI/RSNLSMISHGQRQRVVHDFPKYSVKVIjPWLSPEVTjQQN^ 

DMPATQMLLEKLNGTVPCLLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRPSNGDSPSHPYHRTFSP 

HFHHFVEQCLQRNPDARPSASTLLNHSFFKQIK^ 

LEELEVDDWEF 

40 

SEQ ID NO:204 

Novel (chromosome 1) 

>GL2_4 6A_B12_G3F1 Direction: sense 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAGC 
45 TACCACCTGTTACTTTGACAACTCTAGCTTCAGGGCTACCAGGACCATTTGCAAAACTTACACCCACAGAAAACT 
CAAAGTCTCCAGAGGCATAAGCTCCAATCCCCCAGAGAGCTCGCACTAGCATTCTTCTCCCTTTCTAGTAGTACT 
TTTATGGGGGGGAGTTATTCTTCCCCACAAAAGTGAGCGGCGACAATAAAAACAATGTTTCCCACAAAGAGTTTT 
CCCCGGTTCTTCTCCCAAAAATTTNGGGGGGGCCCCCGTTTNTCCCCCGGGCGACGGGGGCGGGNGGGCCCGCCC 
CCCAAAACCACAANGGNGGTGTGTTCCCTCTGAAAGCTCNNACAAAGAGGGGTGACCCCCCCCGGTTTTGGGTGA 
50 TGAATACCGCCCGAAGATAGGNCGCCCGAAAAAGAGCCAACCCl^GAAAAGAGAGATTTTTATATAGAGGAAGCA 
AAAACGAAANGANAGCGGTTTTTTTTTTTTTGTCCNGCCAACAACATGGGGGGCCCCCGTTTTATAAATAGAGTT 
TTTTGTTTTTTAAACAATCGACTTT^^ 

GGGGGNGCGTGTATTATATAAATTATTTAGTGGGGTGCGGGGANGGAAAGAGNGAAGCTGTTTATCCCATCAAAA 
TATTATTGTG 

55 

SEQ ID NO.205 

>GI,2-30-CON-TOPO Direction: sense 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAGC 
TACCACCTGTTACTTTGACAACTCTAGCTTCAGGGCTACCAGGACCATTTGCAAAACTTACACCCACAGAAAACT 
60 CAAAGTCTCCAGAGGCATAAGCTCCAACCCCCAGAGAGCTCCACAGCATTCTTCTCCTTCTAGTAGTACTTTTGG 
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• # 

TTTCCAGGAAAATCATCTCATOCTCGGGCCACAGTCACAGTCCTGACC^ 

CTAGTTGTATGGAGTCATTCAAA^TACTAGTTCCGAAAGAAGTTTCAAAGGTCCAACGCCGAGCATCTTTTACCT 
TTGCTAATTTTTCCATAAATTGAAAGCCTT 



5 SEQ ID NO:206 

>GL2_46AJ307_G3F1 Direction: N/A 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAGC 
TACCACCTGTTACTTTGACAACTCTAGCTTCAGGGGGCTACCAGGACCATTTGCAAAACTTACACCCACAGAAAA 
CTCAAAGTCTCCAGAGGCATAAGCTCCAACCCCCAGAGAGCTCCACAGCATTCTTCTCCTTCTAGTAGTACTTTT 
10 GGTTTCCAGGAAAATCATCTCATGCTCGGGCCACAGTCACAGTCCTGACCAGCACCATAGAAATTTCAGCCATTA 
TTCTAGTTGTATGGAGTCATTCAAAATACTAGTTCCGAAAGAAGTTTCAAAGGTCCAACGCCGAGCATTCTTTTT 
ACCTTTGCTAATTTTTC 



SEQ ID NO:207 
15 >GL2_46A_B08_G3F1 Direction; sense 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAGC 
TACCACCTGTTACTTTGACAACTCTAGCTTCAGGGCTACCAGGACCATTTGCAAAACTTACACCCACAGAAAACT 
CAAAGTCTCCAGAGGCATAAGCTCCAACCCCCAGAGAGCTCCACAGCATTCTTCTCCTTCTAGTAGTACTTTTGG 
TTTCCAGGAAAATCATCTCATGCTCGGGCCACAGTCACAGTCCTGACCAGCACCATAGAAATTTCAGCCATT 

20 

SEQ ID NO:208 

>GL2_4 6A_JB10_G3F1 Direction: sense 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAGC 
TACCACCTGTTACTTTGACAACTCTAGCTTCAGGGGCTACCAGGACCATTTGCAAAACTTACACCCACAGAAAAC 
25 TCAAAGTCTCCAGAGGCATAAGCTCCAACCCCCAGAGAGCTCCACAGCATTCTTCTCCTTCTAGTAGTACTTTTG 
GTTTCCAGGAAAATCATCTCATGCTCGGGCCACAGTCACAGTCCTGACCAGCACCATAGAAATTTCAGCCATTAT 
TCTAGTTGTATGGAGTCATTCAAAATACTAGTTCCGAAAGAAGTTTCAAAGGTCCAACGCCGAGCATCTTTTACC 
TTTGCTAATTTTTCCATAAATTGAAAGCCTT 



30 SEQ ID NO:209 

>GL2_46A_B11_G3F1 Direction: N/A 

AATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAG 
CTACCACCTGTTACTTTGACAACTCTAGCTTCAGGCTACAGGACATTTGCAAAACTTACACCCACAGAAAACTCA 
AAGTCTCCAGAAGCATAAGCTCCAAACCCCAGANAGCTCCAAGATTCTTCTCCTTCTAGTAGTACTTTTGGTTTC 
35 CAGGAAAATCATCTCATGCTCGGGCCACAGTCACAGTCCTGACCAGCACCATAGAAATTTCAGCATTATTCTAGT 
TGTATGTGAGTCCTTCCAAATACTAGTTCCGAAAGAAGTTTCCAAGGTCCAACGCCGAGCATCTTTTACNTTGGT 
AATTTTTCCCTAAATTGAAAGCCTTCCGCAGAAACCAGCACAGTGGTTAGATAGATAAA 



SEQK>NO:210 

40 Clone 3 0 -Novel (Chromosome 1) > 

ATCTACATGAGGTCCTGTAGATTGAGCAACTTTGGCTATTTGGGTAGCTGATCCACTGCCAAGCTGGCTCCCAGC 
TACCACCTGTTACTTTGACAACTCTAGCTTCAGGGCTACCAGGACCATTTGCAAAACTTACACCCACAGAAAACT 
CAAAGTCTCCAGAGGCATAAGCTCCAACCCCCAGAGAGCTCCACAGCATTCTTCTCCTTCTAGTAGTACTTTTGG 
TTTCCAGGAAAATCATCTCATGCTCGGGCCACAGTCACAGTCCTGACCAGCACCATAGAAATTTCAGCCATTATT 

45 CTAGTTGTATGGAGTCATTCAAAATACTAGTTCCGAAAGAAGTTTCAAAGGTCCAACGCCGAGCATCTTTTACCT 
TTGCTAATTTTTCCATAAATTGAAAGCCTT 



SEQIDNO:211 

Nove 1 ( chromo s ome 3 ; h4 0 ) 

50 >GL2_4 6B_C03_G3F1 Direction: N/A 

GCGTCGCTCGGCGTTAGCCAAGGCCCGGGCGGCGCCACCCTCCGGGGGCACTAGGTCTGGGGCCGCAGTGCCCCC 
CCATAGGCACACACGAGNGAAAGCGCCAATGGGCCTGGGTTTTTTTTAAATTTCCCGGGCGGGGCGNAACCCCGG 
AACACAACGGGCAAACTTGTACAATTTTTGTGGGGANCAACAACCGGGACACAANTTTTTGGGGNAATGGCGCCA 
CAAAAAAANAAACCAACGACGCGACGAAAGCGCGTGGTGTTTGTGCNGAATACTTTTTATAAAACCACCAAATGG 

55 GCGCCTCTCGCTTGTGCCGCCCCCTTCTCCCCAAANGGGGGGCGGCGCTGTTTTTTCCCCAACACGGGCGTGGGG 
CCCAAAAAAAATTGTGGGCGCCCAAAGAAAAAATATTTTAATAAAACAGAGGCGGCCCCCCACAACACAGGCGGT 
TNTGGGGGAACAGGAAACCAACACAACGACAACACGGCGCTGGCGGCAACAAGCAAACACGGGAGCAAACAAAAC 
ACGGGCGGAAAGAAACCACACGGGCGAAACGACTTTTATAACAAACACTTTTGTTGGGGCGCAAACAGCACACAC 
ACAACCAAAATGGGCGCCCCCCCCCNCCGAGGGGGCCCACAAAGGNCGAAGAGAAAAAACCACCCCCCCCCAAAA 

60 AGAGAGCGCCCGCAAAACAACCACCAAAAAAAGGGGAGGAGTTTGGAGAGCAGGGCGAGANGTTTTTTTATATTA 
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5 

f 



AAACACGGGCNACGAAAACATCATTTTATAAATAGGGCAGGCAAAAAACACGATTTATiUVTATTTCAACAGAAAA 
CAGGGNTGGCCTCTGGGGGTTTGGTGTGTGTGCTTCCCNACCGGTGGTGCGCGTGTTTTTCCTCCTGGTGNGGAA 
AAAAGACACCTGTTTTATAAAAGAGGAGNGTATTNCCNCCTTTTTGTGGGTGCGAAAAACAGACGNGAGGCGGGA 
NGATNTATCCCACTCTTTTGGTGGTGCAACAAGTGTTTTATATAGNATACCTG 



SEQE>NO:212 
>GL2_46B_C01__G3F1 Direction: N/A 

GCGTTGCTCGGCGTTAGCCAAGGGCCCGGGCGGCGCCACCCTCCGGGGGCACTAGGTCTGGGGCCGCAGTGCCCA 
GNC^C^GAGCAGCGTTTATCTGGGACGAAGATGAATGGAAAGAATTGGAGCAAAAAGAGGTTGATTACAGCGGGC 
1 0 TCAMGGGTTCAGGCGAATGCAAATAAGCACGTGAAAAGGAAGAAGACGATAATGAAAAGCCGCAGAAACCAGCAC 
AGTGGTTAGAT 

SEQIDNO:213 

>GL2_46B_C02_G3F1 Direction: sense 
15 CAAGCCGGTGCAGCGGTTGTCAGTGTGCTGGGTCTGCGGGCGTTGCTTCGGCGTTAGCCAAGCCGGGCGGCGCCA 
CCCTCCGGGGGGGGGGCCACTAGGTCTGGGGCCGCAGTGCCCAGCACAGAGCAGCGGTTTATCGGGACGAAGAAT 
GAATTGGAAAGAATTGGAGCAAAAAGAGGTTGATAACAGCGGCCTCAGGGGTTCAGGCAATGCAAATAAGCAGTG 
AAAAGGAAGAAGACGATAATGAAAAGCCGCAGAAACCAGCACAGTGGTTAGATAGATAAAGCGGGCGCTCGACTA 
GTCTGAGGTCTGATA 

20 

SEQIDNO:214 

>Gt,2_4 6B_C04__G3Fl Direction: N/A 

GCGTGCTCGGCGTTAGCCAATGCCCGGGCGGCGCCACCTCCGGGGGCACTAGGTCTTGGGGCCGCAGTGCCCAGC 
ACAGAGCAGCGTTTATCGGGACGAAGATGAATGGAAAGAATTGGAGCAAAAAGAGGTTGATTACAGCTGGCCTCA 
25 GGGGGTTCAGGCAATGCAAATAAGCAGTGAAAAGGAAGAAGAC 

SEQIDNO:215 

>GL2__46B_C05_G3F1 Direction: sense 

GCGTCGCTCGGCGTTAGCCAAGGCCCGGGCGGCGCCACCCTCCGGGGGCACTAGGTCTGGGGCCGCAGTGCCCAG 
30 CACAGAGCAGCGTTTATCGGGACGAAGATGAATGGAAAGAATTGGAGCAAAAAGAGGTTGATTACAGCGGCTCAG 
GGGTTCAGGCAATGCAAATAAGCAGTGAAAAGGAAGAAGACGATAATGAAAAG 

SEQIDNO:216 

>GL2_46B_C06_G3F1 Direction: sense 
35 GCGTCGCTCGGCGTTAGCCAAGGCCCGGGCGGCGCCACCCTCCGGGGGCACTAGGTCTGGGGCCGCAGTGCCCAG 
CACAGAGCAGCGTTTATCGGGACGAAGATGAATGGAAAGAATTGGAGCAAAAAGAGGTTGATTACAGCGGCCTCA 
GGGGTTCAGGCAATGCAAATAAGCAGTGAAAAGGAAGAAGACGATAATGAAAAG 

SEQIDNO:217 
40 >GL2_95_1_M13R Direction: anti-sense 

CTTTTCATTATCGTCTTCTTCCTTTTCACTGCTTATTTGCATTGCCTGAACCCTGAGGCCGCTGTAATCAACCTC 
TTTTTGCTCCAATTCTTTCCATTCATCTTCGTCCCGATAAACGCTGCTCTGTGCTGGGCACTGCGGCCCCAGACC 
TAGTGCCCCCGGAGGGTGGCGCCGCCCGGGCCTCGGCTAACGCCGAGCGACGC 

45 SEQIDNO:218 

Clone 95 -Novel (Chromosome 3/ H41) > 

GCGTCGCTCGGCGTTAGCCAAGGCCCGGGCGGCGCCACCCTCCGGGGGCACTAGGTCTGGGGCCGCAGTGCCCAG 
CACAGAGCAGCGTTTATCGGGACGAAGATGAATGGAAAGAATTGGAGCAAAAAGAGGTTGATTACAGCGGCCTCA 
GGGGTTCAGGCAATGCAAATAAGCAGTGAAAAGGAAGAAGACGATAATGAAAAG 

50 

SEQIDNO:219 

Novel (chromosome 17) 

>GL»2_147_1_M13F Direction: anti- sense 

CTTGCCATTCCACTCCAGCCTGGGCAACGAGAGCGAAACTCTGTCTCAAAAAGAAAAGAAAAGGAAAGAAAAACC 
55 ACCTCCCACGkGGTTAAATAAATAATTAACATTTTC 

ATAAATATATGATAGGATATTTTACATTTGATGTATCCTGAAGATTAATTCAACAAATGTTTAGTGAGTAGG 

GTTGTAGGTGCTGGGAATTC^GC^GTGAACAAAACAAAGTCTCTGCCTCATGGAGCTCATATTATAGTAGGGAAG 

ACTGTCAATAAGCAATATACACATTGACTAAATGATAGA^ 

AAATC^GGATAAAAGGATAAGAGTGATATGGGGATTCTCTTAGTTAGGGTAGATAAGGAAGGCCTCTTAGCTAAG 
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GAGGCACATAAATATCTGATCATCTAATGAGAGAAGGAGCAACATGGGATTTCTAGAGGGAAACTATCCTAGAAA 
GA 

SEQ ID NO:220 
5 >GL2_147_2_M13F Direction: sense 

CTGCTTCTCTTCTAGGATAGTTTCCCTCTAGAAATCCATGTTGCTCCTTCTCTCATTAAATGATCAGATATTTAT 
GTGCCTCCTTAGCTAAGAGGCCTTCCTTATCTACCCTAACTAAGAGAATCCCCATATCACTCTTATCCTTTTATC 
CTGATTTATTTTTCCATAAAACTTGCCACATTACTGGGCATTATCTATCATTTAGTAATGTGTATATTGCTTATT 
GACAGTCTTCCCTACTATAATATGAGCTCCATGAGGCAGAGACTTTGTTTTGTTCACTGCTGAATTCCCAGCACC 
10 TACAACAAGCCTACTCACTAAACATTTGTTGAATTAATCTTCAGGATACATCAAATGTAAAAATATCCTATCATA 
TATTTATTGATGTGATGCATAGGAAAATGTCTGGAAGGGTACATACCAAAATGTTAATTATTTATTTAACCTGGT 
GGGAGGTGGATTATTTCTTTCCTTTTCTTTCTTTTTGAGACCAGAGTTTCCGCATCTTCGTTGCCCCAGGCTGGG 
AGTGGAAATTGGGCAAAGCCCGCAGAAACCCCAAGCAACAAGTGGGTTTAGATAGNATCAAAGCGGGCCGCTCGA 
CTTAANTCCTTGAAGGTCTGATTACTCAACCTGACTTGTCAGAAA 

15 

SEQIDNO:221 

>GL2_147_2__M13:R Direction: anti- sense 

CTTGCCATTCCACTCCAGCCTGGGCAACGAGAGCGAAACTCTGTCTCAAAAAGAAAAGAAAAGGAAAGAAAAACC 
ACCTCCCACCAGGTTAAATAAATAATTAACATTTTGGTATGTACCCTTCCAGACATTTTCCTATGCATCACATCA 

20 ATAAATATATGATAGGATATTTTACATTTGATGTATCCTGAAGATTAATTCAACAAATGTTTAGTGAGTAGGCTT 
GTTGTAANGTGCTGGGAATTCAGCAGTGAACAAAACAAAGTCTCTGCCTCATGGAGCTCATATTATAGTAGGGAA 
GACTGTCAATAAGCAATATACACATTACTAAATGATAGATAATGCCCAGTAATGTGGCAAGTTTTATGGAAAAAT 
TAATCAGGATAAAGGATAAGAGTGATATGGGGATTCTCTTAATTAGGGTAGATNAGGAAGGGCTCTTTAGCTNAA 
GGAAGCAACATAAATATCTGATCATTTTAAAANGGGAAGGAACGAAAAAGGGGAAAGCCAAAACACAAATGGGGG 

25 ACAATTTATACCTTACGAAAAGGGGGAAAAAACCATAAAAT 
CCAGCAAAAACAA 

SEQ ID NO:222 

>GLi2_147_3_M13F Direction: anti -sense 

30 CTTGCCATTCCACTCCAGCCTGGGCAACGAGAGCGAAA.CTCTGTCTCAAAAAGAAAAGAAAAGGAAAGAAAAACC 
ACCTCCCACCAGGTTAAATAAATAATTAACATTTTGGTATGTACCCTTCCAGACATTTTCCTATGCATCACATCA 
ATAAATATATGATAGGATATTTTACATTTGATGTATCCTGAAGATTAATTCAACAAATGTTTAGTGAGTAGGCTT 
GTTGTAGGTGCTGGGAATTCAGCAGTGAACAAAACAAAGTCTCTGCCTCATGGAGCTCATATTATAGTAGGGAAG 
ACTGTCAATAAGCAATATACACATTACTAAATGATAGATAATGCCCAGTAATGTGGCAAGTTTATATGGAAAAAT 

3 5 AAATCAGGATAAAAGGATAAGAGTGATATGGGGATTCTCT 

SEQ ID NO:223 

>GLi2_14 7__3_M13R Direction: sense 

CTGCTTCTCTTCTAGGATAGTTTCCCTCTAGAAATCCATGTTGCTCCTTCTCTCATTAAATGATCAGATATTTAT 
40 GTGCCTCCTTAGCTAAGAGGCCTTCCTTATCTACCCTAACTAAGAGAATCCCCATATCACTCTTATCCTTTTATC 
CTGATTTATTTTTCCATAT^AACTTGCCACATTACTGGGCATTATCTATCATTTAGTAATGTGTATATTGCTTATT 
GACAGTCTTCCCTACTATAATATGAGCTCCATGAGGCAGAGACTTTGTTTTGTTCACTGCTGAATTCCCAGCACC 
TACAACAAGCCTACTCACTAAACATTTGTTGAATTAATCTTCAGGATACATCAAATGTAAAATATCCTATCATAT 
ATTTATTGATGTGATGCATAGGAAAATGTCTGGAAGGGTACATACCAAAATGTTAATTATTTATTTAACCTGGTG 
45 GGAGGTGGTTTTTCTTTCCTTTTCTTTTCTTTTTGAGACAGAGTTTACGCTCTCGTGTGCCCAGGCCTGGAGTGG 
AATGGCAAGCCGCAGAANCCAGCACAGTGGTTANGATNAATTAAAGCGGGCNAGTCCAATAAGTCTGAAGGCTGA 
AACACACACACAGAGACGGTCACAAAGGGCGAAATNCGGCAAAATNCCATCCCACTGGGGGCCGCATCACAAAAT 
GltfATNGAGAAGGGGACCAAATGGCCAAAGANGAGCGAT 

50 SEQ ID NO:224 

>GIj2_147_4_M13F Direction: sense 

CTGCTTCTCTTCTAGGATAGTTTCCCTCTAGAAATCCATGTTGCTCCTTCTCTCATTAAATGATCAGATATTTAT 
GTGCCTCCTTAGCTAAGAGGCCTTCCTTATCTACCCTAACTAAGAGAATCCCCATATGACTCTTATCCTTTTATC 
CTGATTTATTTTTCCATAAAACTTGCCACATTACTGGGCATTATCTATCATTTAGTAATGTGTATATTGCTTATT 
55 GACAGTCTTCCCTACTATAATTATGAGCTCCATGAGGCAGAGACTTTGTTTTGTTCACTGCTGAATTCCCAGCAC 
CTACAACAAGCCTACTCACTAAACATTTGTTGAATTAATCTTCAGGATACATCAAATGTAAAATATCCTATCATA 
TATTTATTGATGTGATGCATAGGAAAATGTCTGGAAGGGTACATACCAAAATGTTAATTATTTATTTAACCTGGT 
GGGAGGTGGTAA 
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SEQIDNO:225 

>GL2__147__1_M13R Direction: sense 

CTGCTTCTCTTCTAGGATAGTTTCCCTCTAGAAATCCATGTTGCTCCTTCTCTCATTAGATGATCAGATATTTAT 
GTGCCTCCTTAGCTAAGAGGCCTTCCTTATCTACCCTAACTAAGAGAATCCCCATATCACTCTTATCCTTTTATC 
5 CTGATTTATTTTTCCATAAAACTTGCCACATTACTGGGCATTATCTATCATTTAGTAATGTGTATATTGCTTATT 
GACAGTCTTCCCTACTATAATATGAGCTCCATGAGGCAGAGACTTTGTTTTGTTCACTGCTGAATTCCCAGCACC 
TACAACAAGCCTACTCACTAAACATTTGTTGAATTAATCTTCAGGATACATCAAATGTAAAATATCCTATCATAT 
ATTTATTGATGTGATGCATAGGAAAATGTCTGGAAGGGTACATACCAAAATGTTAATTATTTATTTAACCTGGTG 
GGAGGTGGTTTTTCTTTCCTTTTCTTTNCTTATTGAGACAGAGTTTCGCTCTCGTTGCCCCGGCTGGAAGTGGAA 
10 TGGCAAGCCGGCAGGAACCCAAGACAACAAGGAGGTTAAGATCAGAATAAAAGACGGGGACCGCATCCAGAAAC 

SEQ ID NO:226 

>GL2__147__4_M13R Direction: anti-sense 

CTTGCCATTCCACTCCAGCCTGGGCAACGAGAGCGAAACTCTGGTCTCAAAAAGAAAGGAAAAGGAAAGAAAAAC 
15 CACCTCCCACCAGGGTTAAATAAATAATTAACATTTTGGTATGTACCCTTCCAGGACATTTTCCTATGCATCACA 
TCAATAAATATATGATAGGATATTTTACATTTGATGTATCCTGAAGATTAATTCAACAAATGTTTAGTGAGTAGG 
CNTGTTAGTAGGTGCTGGGAATTCAGCAGTGAACAAAACAAAGTCTCTGCCTCATGGAGCTCATATTATAGTAGG 
GAAGACTGTCAATAAGCAATATACACATTACTAAATGATAGATAATGCCCAGTAATGTGGCAAGTTTTATGGAAA 
AATNAATCAGGATAAAAGGATAAGAGTGATATGGGGATTCTNCTTAGATTAGGGTAGATAAAGGAANGCCTCTTA 
20 GNCTNAAGAGGCCACATAAACAAAATCAATTCCTTGGGAAAATCCAANCTGNCTAAACGTAGACAGGGACAAGCA 
ANAGGCGGACGG 

SEQ ID NO:227 

Clone 147 - Novel (Chromosome 17) > 

25 CTGCTTCTCTTCTAGGATAGTTTCCCTCTAGAAATCCATGTTGCTCCTTCTCTCATTAAATGATCAGATATTTAT 
GTGCCTCCTTAGCTAAGAGGCCTTCCTTATCTACCCTAACTAAGAGAATCCCCATATCACTCTTATCCTTTTATC 
CTGATTTATTTTTCCATAAAACTTGCCACATTACTGGGCATTATCTATCATTTAGTAATGTGTATATTGCTTATT 
GACAGTCTTCCCTACTATAATATGAGCTCCATGAGGCAGAGACTTTGTTTTGTTCACTGCTGAATTCCCAGCACC 
TACAACAAGCCTACTCACTAAACATTTGTTGAATTAATCTTCAGGATACATCAAATGTAAAATATCCTATCATAT 

30 ATTTATTGATGTGATGCATAGGAAAATGTCTGGAAGGGTACATACCAAAATGTTAATTATTTATTTAACCTGGTG 
GGAGGTGGTTTTTCTTTCCTTTTCTTTTCTTTTTGAGACAGAGTTTCGCTCTCGTTGCCCAGGCTGGAGTGGAAT 
GGCAAG 

SEQIDNO:228 
35 Novel (chromosome 8) 

>GL1_41_2_G3F1 Direction: sense 

GCTTGCAAAGGAGAGGCTGTGACTACCAAGGTCGTGTCAACAACTGAATGGCTGAAATACCCAAACTTGCCCATG 
CAAATGGGCTTGGGTCTCTCCTGGCAGCCGCCTTTGAAGGCTCTAGACTTATCTGTGAACTCCTTTTTTGAGAGG 
GTCTTTCCAACTAGTGGTTTATTCTTTGACTCTCCTCATACCTTTTTTGCCAGAGAGTGAGAGTGAGAAGGGAGG 
40 GCTAATGCCTGAGCTCCTGCCCTTTCTATGCAGTGAGGGTCAAGATCCTCAGCTAGTGTTTGAGGGAACTGGTGT 
AACCTGGGTCTCTCATTTTCTACCATCCAAGTTGCC 

SEQ ID NO:229 

>GL1_41_2_G3R1 Direction: anti -sense 
45 GGCAACTTGGATGGTAGAAAATGAGAGACCCAGGTTACACCAGGTTCCCTCAAACACTAGCTGAGGATCTTGACC 
CTCACTGCNTAGAAAGGGCAGGAGCTCAGGCATTAGCCCTCCCTTCTCACTCTCACTCTCTGGCAAAAAAGGTAT 
GAGGAGAGTCAAAGAATAAACCACTAGTTGGAAAGACCCTCTCAAAAAAGGAGTTCACAGATAAGTCTAGAGCCT 
TCAAAGGCGGCTGCCAGGAGAGACCCAAGCCCATTTGCATGGGCAAGTTTGGGTATTTCAGCCATTCAGTTGTTG 
ACACGACTTGGTAGTCACAGCCTCTCCTTTGCAAGC 

50 

SEQ ID NO:230 

>GL1_41_3__G3F1 Direction: sense 

GCTTGCAAAGGAGAGGCTGTGACTACCAAGTCGTGTGAACAACTGAATGGCTC 

AAATGGGCTTGGGTCTCTCCTGGCAGCCGCCTTTGAAGGCTCTAGACTTATCTGTGAACTCCTTTTTTGAGAGGG 
55 TCTTTCCAACTAGTGGTTTATTCTTTGACTCTCCT(^TACCTTTTTTGCCAGAGAGTGAGAGTGAGAAGGGAGGG 
CTAATGCCTGAGCTCCTGCCCTTTCTATGCAGTGAGGGTCAAGATCCTCAGCTAGTGTTTGAGGGAACTGGTGTA 
GCCTGGGTCTCTCATTTTCTACCATCCAAGTTGCC 
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SEQIDNO:231 

>GL1_41_3_G3R1 Direction: N/A 

GGCAACTTGGATGGTAGAAAATGAGAGACCCAGGCTACACCAGNTTCCCTCAAACACTAGCTGAGGATCTTGACC 
CTCACTGCATAGAAAGGGCAGGAGCTCAGGCATTAGCCCTCCCTTCTCACTCTCACTCTCTGGCAAAAAAGGTAT 
5 GAGGAGAGTCAAANAATAAACCACTANTTGGANAGACCCTCTCAANAAAGGAGTTCACAGATAANTNTATAGCCT 
TCAAAGGCGGCTGCCAGGAGAGAC 

SEQIDNO:232 
Novel (chromosome 8) > 
10 GCTTGCAAAGGAGAGGCTGTGACTACCAAGTCGTGTCAACAACTGAATGGCTGAAATACCCAAACTTGCCCATGC 
AAATGGGCTTGGGTCTCTCCTGGCAGCCGCCTTTGAAGGCTCTAGACTTATCTGTGAACTCCTTTTTTGAGAGGG 
TCTTTCCAACTAGTGGTTTATTCTTTGACTCTCCTCATACCTTTTTTGCCAGAGAGTGAGAGTGAGAAGGGAGGG 
CTAATGCCTGAGCTCCTGCCCTTTCTATGCAGTGAGGGTCAAGATCCTCAGCTAGTGTTTGAGGGAACTGGTGTA 
ACCTGGGTCTCTCATTTTCTACCATCCAAGTTGCC 

15 

SEQ ED NO:233 

Novel (chromosome 9) 

>GL2-49-5M13R Direction: anti-sense 

CTGGCCGCACGCAGCTCTTTTCCTGGGGATCCCAGATGGTTGTGCGCTGGGGGCCTGGCGGAGGTCCGAGTCGGG 
20 TTGGATGGCTGTGTCCGTCGCCATGTTTGTGTGGTGCCTGGCCCCGTGGTCCCATGTTCTGGCAGCTTCTGGCTC 
CAGCCTCAGCCTGTAGTGGGGTGTTACGTGGTAGNGAGTGCGGTAGCTGGGGGCCTGCAGTGGGGTAGGTCAGTG 
GTAGGAGTGCAGNTAGCATAGAAGTTANAAAGGTTACTAGTAGAGACTGGTCCCATACTTAGAGATGGGACACCA 
TGACCTGATGGNTACNAC CCTG ACAGGCCGCAG AAACCAGAC AC ACTGGAGCCAC CGACTG ACACAACGGGC CNT 
GGGNACGGACGTCCAATNGCGTATGANTACCANGACAGTCAGGANCCCCAAATGACCCGGAAGCAGGAAACGGGG 
25 CAGAAAAAANCAATAGCCAAGGAATAGT CACANGGTCGACANAAC CTGNGCAGGGACCAGNAAACCAGAANGACC 
AATGCACANTAC ATAAGAAGGGGGGGGG CACN CCACAAAAGATATG CCGGGCGACGCCAGATAAGATAAGGGACT 
AGGGAAGGAGTCCGGGNAAANATGTAACGAAAGAGTATCGAACCAATGGAGGGNCGCCGGGTACCCGNGCGNAAG 
GACAAACAANAAGCAGGCATCCGGACTGGGAACCAATGGGGGGGAAAGAAGCAGCGCCCCCATGGGGANAGGGAT 
GTAAANCCACAACAACACGTGTNAAACG 

30 

SEQ ID NO:234 

>GL2-49-4M13F Direction: N/A 

AATTGAANTTACACTCACTATAGGGCGAATTGGGCCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGAAT 
ATCTGCAGAATTCGCCCTTCTCTCGGCATGGACGAGCTGTACAAGGAGGAGGCCGCAAGGCCGGTGGCAGCGGTG 

35 GCTCCAAGTGGTGGCCTTGGGGGTTCTTGCGGCTGCAGGGAGACCACAGGCAGGTGCCCACCTAAGAGGGACAGC 
CACAGAAACCCTCTAACCTCAGCACTGCACTCCACCACGAACACCCCACGCAGGCCCTCAGGCAACCCGGCAACC 
TTCC^AACCAACGGAACACCCCACACAGGCTGAGGCTGGAGCCAGAAGCCGCCAGAACATGGGACCACAGGGCCA 
GGCACCACACAAACATGGCGACGGACACAGCCAATCCAAACCCCCGGAAAAACCGATTTCCNCGGNGGGGGAAAA 
ACCGCCCCAGTTTTCCACCCCGGGAGAGCGCCCCAAAAGGGGGGGGGNACACCCACCCACNCACCCCA7VAGGGGN 

40 CGCAGGGGCCAAACAGCAGAAANCCCCGAANTTCCCTTGCGGGGGGAATTCGCCGCGCCCAAAGGGGAACNAGCA 
AGGAAGGCCATTGCCGGTTGCCGGGCCCAAGCACCGCCAGGAACAGCGCGCAAGGCGAC 

SEQ ID NO:235 

>GL2-49-lM13F Direction: sense 
45 CTGCAGGGAGACCACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACTC 
CACCACGACCACCCCACGCAGGCCCTCAGCACCGCACTCCACCACGAACACCCCACACAGGCTGAGGCTGGAGCC 
AGAAGCTGCCAGAACATGGGACCACAGGGCCAGGCACCACACAAACATGGCGACGGACACAGCCATCCAACCCGA 
CTCGGACCTCCGCCAGGCCCCCAGCGCACAACCATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCCAG 

50 SEQ ID NO:236 

>GL2-49-lM13R Direction: anti-sense 

GCTGGCCGCACGCAGCTCTTTTCCTGGGGATCCCAGATGGTTGTGCGCTGGGGGCCTGGCGGAGGTCCGAGTCGG 
GTTGGATGGCTGTGTCCGTCGCCATGTTTGTGTGGTGCCTGGCCCTGTGGTCCCATGTTCTGGCAGCTTCTGGCT 
CCAGCCTCAGCCTGTGTGGGGTGTTCGTGGTGGAGTGCGGTGCTGAGGGCCTGCGTGGGGTGGTCGTGGTGGAGT 
55 GCAGTGCTGAGGTTAGAGGGTTTCTGTGGCTGTCCCTCTTAGGTGGGCACCTGCCTGTGGTCTCCCTGCAG 

SEQ ID NO:237 

>GL2-49-2M13F Direction: sense 

CTGCAGGGAGACCACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACTC 
60 CACCACGACCACCCCACGCAGGCCCTCAGCACCGCACTCCACCACGAACACCCCACACAGGCTGAGGCTGGAGCC 
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AGAAGCTGCCAGAACATGGGACCACAGGGCCAGGCACCACACAAACATGGCGACGGACACAGCCATCCAACCCGA 
CTCGGACCTCCGCCAGGCCCCCAGCGCACAACCATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCCAG 

SEQ ID NO:238 
5 >GL2~49-2M13R Direction: N/A 

CTGGCCGCACGCAGCTCTTTTCCTGGGGATCCCAGATGGTTGTGCGCTGGGGGCCTGGCGGAGGTCCGAGTCGGG 
TTGGATGGCTGTGTCCGTCGCCATGTTTGTGTGGTGCCTGGCCCTGTGGTCCCATGTTCTGGCAGCTTCTGGCTC 
CAGCCTCAGCCTGTGTGGGGTGTTCGTGGTGGAGTGCGGTGCTGAGGGCCTGCGTGGGGTGGTCGTGGTGGAGTG 
CAGTGCTGAGGTTANAGGGTTTCTGTGGCTGTCCCTCTTAGGTGGGCACCTGCCTGTGGTCTCCCTGCAGCCGCA 
1 0 AAACCCANCACACTGGAGCCACCGCTG 



SEQ ID NO:239 

>GL2-49-4M13R Direction: N/A 

CTGGCCGCACGCAGCTCTTTTCCTGGGGATCCCAGATGGTTGTGCGCTGGGGGCCTGGCGGAGGTCCGAGTCGGG 
15 TTGGATGGCTGTGTCCGTCGCCATGTTTGTGTGGTGCCTGGCCCTGTGGTCCCATGTTCTGGCGGCTTCTGGCTC 
CAGCCTCAGCCTGTGTGGGGTGTTCGTGGTGGAGTGCGGTGCTGAGGGCCTGCGTGGGGTGGTCGTGGTGGAGTG 
CAGTGCTGAGGTTAGAGGGTTTCTGTGGCTGTCCCTCTTAGGTGGGCACCTGCCTGTGGTCTCCCTGCAG 



SEQ ID NO:240 

20 >GL2_72_G07_G3F1 Direction: N/A 

CTGCAGGGAGACCCGCAGGCAGGTGCCCACCTAAGGGGGACAGCTACAGAAACCCTCTAACTCAGCACTGNACTC 
CACCACGACACCCCACGCAGGCCCTCAGCACCGCACTCCACCACGAACAACCCACACAGGCTGAGTGCTGGAGCC 
AGAAGCTGGCAGAACATGGGACCACAGGGGCCAGGCACCACACAAACATGGCGACGGACACAGGCATNCAAACCG 
ACTCGGACCTCCGCCAGGNCCCCAGCGCACAAACATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGGCAGCGCA 

25 GAAACCAGCACAGTGGTTAGATATGATTAAGCGGGCGNGTCGANTAATCTGAGGTCTGATACTCACTGACTGTCG 
TAAGGGNGAATTCGCGGGCGCGTAAATTCAATTCGGCCTATAGTGAGTCGTATTACAATTCACTGGGCGGCGTTT 
TACAACGTCTGTGACTGGGAAAAACCTGNGCGTTATCCAACTTAATCTGNGCTTGGAGAATTTCCCTTTTGCAGA 
CTGGGCGTAATAACGAAAAAGGGNCCGAA 



30 SEQ ID NO:241 

>GL2_72_G08_G3F1 Direction: sense 

CTGCAGGGAGACACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACTGC 
ACCACGACCACCCCACGCAGGCCCTCAGCACCGCACTCCACCACGAACACCCCACACAGGCTGAGGCTGGAGCCA 
GAAGCTGCCAGAACATGGGACCACAGGGCCAGGCACCACACAAACATGGCGACGGACACAGCCATCCAACCCGAC 
35 TCGGACCTCCGCCAGGCCCCCAGCGCACAACCATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCCAG 



SEQ ID NO:242 

>GL2_/72_G09_G3F1 Direction: sense 

CTGCAGGAGACCGCAGGCAGGTGCCCACCTAAGGGGGACAGCTACAGAAACCCTCTAACCTCAGCACTGCACTCC 
40 ACCACGAACACCCCACGCAGGCCCTCAGCACCGCACTCCACCACGAACACCCCACACAGGCTGAGGCTGGAGCCA 
GAAGCTGGCAGAACATGGGACCACAGGGGCCAGGCACCACACAAACATGGCGACGGACACAGCCATCCAACCCGA 
CTCGGACCTCCGCCAGGCCCCCAGCGCACAACATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCAG 



SEQ ID NO:243 
45 >GL2_95_2B10_G3F1 Direction: sense 

CTGCAGGAGACACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACTCCA 
CCACGACCACCCCACGCAGGCCCTCAGCACGGCACTCCACCACGAACACCCCACACAGGCTGAGGCTGGAGCCAG 
AAGCTGCCAGAACATGGGACCACAGGGCCAGGCACCACACAAACATGGCGACGGACACAGCCATCCAACCCGACT 
CGGACCTCCGCCAGGCCCCCAGCGCACAACCATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCCAG 

50 

SEQIDNO:244 

>GL2_95_2B11_G3F1 Direction: sense 

CCGCAGTGGAGACACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACTC 
CACCACGACACCCACGCAGGCCCTCAGACCGCACTCCACCACGAAAAACCCACACAGGCGTGAGGCTGGAGCCAG 
55 AAGCTGC<^GAACATGGGANCACAGGGNCAGGCGACCACACAAACATGGCG^^ 

CATCGGACTCCGCAGGGCCCCAGCGCACAAACATACTGGGATCCCCACGGAAAAGAGCTGCGTGCTGGGCAGNAG 
CGCNAGAAACCAGCAC^GCTGGTTAGAGTACAGATAAANACGGGCGCGTCGCACGTAAANCTGAAAAAGGTCGTG 
AGTATAATCAACATGTANACTGNTT 
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SEQ ID NO:245 

>GL2-49-5M13F Direction: sense 

CTGCAGGGGAGACCACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACT 
CCACCACGANCACCCCACGCAGG CCC CCAGCACCGCACTC CACCACGAAACAC CCCAAC AC AGGCTGAGGCTGGA 
5 GCCAGAAGCTGGCCAGAACATGGGACCAACGNGCGGGNCCCCAACGCNGGGCGGGCCCAGAACGACCGAAAAGCG 
AAACCCCAAAAGAAGl^GCGCAAATTATGCAGACCAGGAAACCAGATGAAAACAAACCAAAGGCGACCAAATACC 
CCAAAANANCACCGCNCAGGAAAGCATACACGCGGGGAAACGCCCAGTTCCACCACAGGGGCCCCACAAGGAGGG 
GACCCNCCACCNACGNGCAGGGGCACCGGCCAGCAGACAGCGAGAGCGCCGCGAAGTACCGATTAGGAGGCGGCG 
AATACCCGACCACCAAAAAAGGGGGAAACACGAAAGAAAGCTGGGAAAACGGCCCGATTTTAGGACCACGGGAGT 

10 

TTGACCGCACGGGGAGACGCCCCCCCAAAGGGACACCCCCAGAGGCACAAAAAGCAAGAAGACACCCCCCCCCAA 
AGAGACCGCTCAAGACCCGAAAGAGGTTGGAGCGGCTTCCTTAACAGAGGAAGCTTAACNCGGCACAGTTNAAAC 
GAAGGGCGAGTGGGC 

SEQ ID NO:246 
15 >GL2_49_1_M13R Direction: anti -sense 

CTGGCCGCACGCAGCTCTTTTCCTGGGGATCCCAGATGGTTGTGCGCTGGGGGCCTGGTGGAGGTCCGAGTCGGG 
TTGGATGGCTGATGTCCGC CGCC ATGATTTGATAGATGGATGC CTGGACCCTGATGAGTCC CAATGATTCTGGC A 
GACATTCTGGACATC C AGC ACATCAGAC CTGATGATGGGGATGNTTCGATGGATG 

20 SEQ ID NO:247 

>GL2_49_2__M13R Direction: N/A 

CTGGCCGCACGCAGCTCTTTTCCTGGGGATCCCAGATGGTTGTGCGCTGGGGGCCTGGCGGAGGTCCGAGTCGGG 
TTGGATGGCTGGTCGCGCAGTGGTGGTGCCTGGCCTGGGCCTGTCTGGCAGCTTCTGGCTCCAGCCTCAGCCCTG 
TGTGGGGCTGGTTCCGNTGGTGGAGTGCGGGTGCTGAAGGCCATGCGATGGAGAGTGGATCNAGGAAGGAGATGC 
25 AGAAGCATAGAGAGTTACGAGGGTATCTAGTGGCTGTACCCTCTATAGGTAGGGCACCATGACACTAGATGGTAC 
TWCCTTGACAGAACGCAAGAAACACCAAGAAAAACCTGAGAGGCCAACCAGAAATGAC 

SEQEDNO:248 

Clone 49 -Novel (Chromosome 9) > 
30 CTGCAGGAGACCACAGGCAGGTGCCCACCTAAGAGGGACAGCCACAGAAACCCTCTAACCTCAGCACTGCACTCC 
ACCACGACCACCCCACGCAGGCCCTCAGCACCGCACTCCACCACGAACACCCCACACAGGCTGAGGCTGGAGCCA 
GAAGCTGCCAGAACATGGGACCACAGGGCCAGGCACCACACAAACATGGCGACGGACACAGCCATCCAACCCGAC 
TCGGACCTCCGCCAGGCCCCCAGCGCACAACCATCTGGGATCCCCAGGAAAAGAGCTGCGTGCGGCCAG 

35 SEQ ID NO:249 

Novel (clone 1/226) 

>GL2_1_3_M13F Direction: anti-sense 

GAGTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCT 
40 GAGGAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTG 
AAGTGTGAAGGCCCTCATAACCTCCATCTTGCCTCAGGACTATAGTCCTGGAACCCTNGGGGCGGAGAAAAGCGC 
CAACATTTCATNCCTGCATACATAAGGGAGAAGGAGACAGGACAACGATAAGTGAAAGAGAACAGAACAAGCAAG 
AAAAGAAGCGGANAACGGCCCCAGACAATAGTAAGGGCGAANGAATGGGCAGAAGNCTTGCAANCGTNCCCGGGG 
GCAATACAATCCCTTGAACACAAGANCCAGGGCANNCCGGGACCACGCAAGAAAACCAAGAAACACAT 

45 

SEQIDNO:250 

>GL2_81B_JE02_G3F1 Direction: N/A 

GCCCGAGCCGGACTGGTCAGGATGATCACCGGACGCGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCT 
CTTCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGTGAATGAAAGTGGCGCTTTC 

50 TCCGGGCCCAGGGTTCCAGGACATAGTCTGAGTGCAAGATGGAGGGTATGAGGGGGCTTCACACTTCACTTCATC 
CTTTTAACCATTAAAATACAAAGCGAACTACANCTGGATTTTTCCAAACAAATTTTATTTCCTCAGAGTCTTCCT 
TAATCCTATGGAACAAGAAGCTGGCACTGAATAGTGGGCCAGTATAGGGGCTTGCGTTTTCTANATCCCTTCCCC 
CAATATTAAAATATATGACTTTTAAAAAAAAAAAAAAAAGGGNGCCGGTCGCGATCTAGAACTAGTCCGGAGAAA 
CCAGACAGTTGGTTAGATAGATAAAGCGGCGCGTCGACTANTCTGAGGTCTGATACTCACTGACTGTCGTAAGGG 

55 CGAATTCGTTTTAAACCTGCAGGACTAGTCCCTTTATGAGGGTTAATTCTGAGCTTGGCGTAATCAT 

SEQH)NO:251 

>GL2-96-lM13R Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
60 TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
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GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACGTTCACTTCAGTCCCT 
TCTAC CCATC ACAAC ATACAAGG C AACTAC ACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTC CTT 
AATCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAAT 
ATAAAAATATAGACTTTTAAAAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 

5 

SEQ ID NO:252 

>GL2-96-3M13F Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
10 GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 
TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATATAGAC 

15 SEQIDNO:253 

>GL2-96-3M13R Direction: N/A 

ACTAGTTCTAGATCGCGAGCGGCGCCCTTTTTTTTTTTTTTTTTAAAAGTCTATATTTTTATATTGGGGGGAGGG 
AGTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCTG 
AGGAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTNTGNTATGTTGTGATG 

20 

SEQ ID NO:254 

>Gti2- 96 -4M13F Direction: anti-sense 

ACTAGTTCTAGATCGCG AGCGG CGC C CTTTTTTTTTTTTAAAAGTCTATATTTTTATATTGGGGGGAGGGAGTAG 
AAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCTGAGGAA 
25 ATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTGAAGTGT 
GAAGGCCCCTCATACCCTCCATCTTGCCTCAGACTATGTCCTGGAACCCTGGGGCGGAGAAAGCGCCACTTTCAT 
TCCTGCTTCTTGGGATTGTTGACGGCCACGTAGTGATAGAGAACGACAAGCAAGAAGAGCGACACGCCCAGCATG 
TTGGCGAAGATGGCGAGCTGCACGTCCGTGATCATCCTGACCAGTCCGGCTCGGCCCGCAGAAACCAGCACACTG 
GAGCCACCGCTGCCACCGGCCTT 

30 

SEQ ID NO:255 

>GL2-96-5M13F Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGT 
35 GCCCGAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCAGAACATAGAAAGCAACTACACCH'GGAl^ 

TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATATAGACTTTTAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGTCCGCAAAACCCAGCAC 
AGTGGTTAGATAGATAAAGC 

40 

SEQ ID NO:256 

>GL2-96-5M13R Direction: N/A 

ACTAGTTCTAGATCGCGAGCGGCGCCCTTTTTTTTTTTTTTTTAAAAGTCTATATTTTTATATTGGGGGGAGGGA 
GTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCTGA 
45 GGAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCATATAAGTATGTTGTGATA 

SEQIDNO:257 

>GL2_1_1_M13R Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
50 TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GC CCCAGGGTTCCAGGACATAGT CTGAGGCAAGATGGAGGGTATGAGGGGC CTTC ACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 
TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATATAGACTTTTAAAAAAAAAAAAAAAAAGGGCGCCGATCGCGATCTAGAACTAGT 

55 

SEQIDNO:258 

>GIj2_1_2_M13F Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
60 GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAAGGGCCTTCACACTTCACTTCATCCCTT 
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TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 
TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATACTAGACTTATT 

5 SEQIDNO:259 

>GL2_1__2_M13R Direction: anti-sense 

GAGTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCT 
GAGGAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTG 
10 AAGTGTGAAGGCCCTTCATACCCTCCATCTTGCCTCAGACTATGTACCTGGAACCCTGGGGCNGAGAAAGCGCCA 
CNTTCATTCCTGCTTCTAGGGATCGNNAGACGGACACGATAGNTGACTAGAGAACGACAAGCAAGAA 

SEQ ID NO:260 

>GIi2_226_l_l_M13F Direction: sense 

15 GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACCACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTAAT 
CCTCATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 

20 AAAATATAGACTTTTAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 

SEQIDNO:261 

>GL2_226_1__1_M13R Direction: anti- sense 

25 GTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAGGAAGACTCTGAG 
GAAATAAAAGTTGXTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTGAAG 
TGTCAAGGCCCCTCATACCCTCCATCTTGCCTCAGACTATGTCCTGGAACCCTGGGGCGGAGAAAGCGCCNCTTT 
CATTCCTGCTTCTTGGGATTGTTGACGGCCACGTGGTGATAGAGAACGACAAGCAAGAAGAGCGANACGCCCAGC 
ATGTTGGCGAAGATGGCGAGCTGCACGTCCGTGATCATCCTGACCAGTCCAGGCTCGGC 

30 

SEQ ED NO:262 

>GIi2_226_l_J2_M13F Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
35 GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 
TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATATAGACTTTTAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 

40 SEQ ED NO:263 

>GL2_226_1_2_M13R Direction: anti-sense 

ACTAGTTCTAGATCGCGAGCGGCGCCCTTTTTTTTTTTTTTTTAAAAGTCTATATTTTTATATTGGGGGGAGGGA 
GTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCTGA 
GGAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTGAA 
45 GTGTGAAGGCCCCTCATACCCTCCATCTTGCCTCAGACTATGTCCTGGAACCCTGGGGCGGAGAAAGCGCCACTT 
TCATTCCTGCTTCTTGGGATTGTTGACGGCCACGTAGTGATAGAGAACGACCAGCAAGAAGAGCGACACGCACCA 
GCATGTTGGCGAAGATGGCGAGCTGCACGTCCGTGATCATCCTGACCAGGTCCGGCATCGG 

SEQ ID NO:264 

50 >GIj2_226_1_3_M13F Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 

55 TCCTATGGAACAAGAAGCTGC C ACTGAATAGGGCC CAGTATAGGGGCTTGCTTTTCTACTCCCTCCCC CCAATAT 
AAAAATATAGACTTTTAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 
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SEQ ID NO:265 

>GL2_J22 6_1_4_M13F Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCGTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
5 GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTCTTATTTCCTCAGAGTCTTCCTTA 
ATCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATA 
TAAAAATATAGACTTTTAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 

10 SEQ ID NO:266 

>GL2__81B_E01_G3F1 Direction: sense 

GCCGAGCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCTT 
CTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAGGTGGCGCTTTCTCCG 
CCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTCT 
15 ACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGGTCTTCCCTTA 
ATCCTATGGAACAAGAAGCTGNCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATA 
TNAAAATATAGACTTTTAAAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 

SEQ ED NO:267 

20 >GL2_81B_E05_G3F1 Direction: N/A 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GNCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 

25 TCCTATGGAACAAGAAGCTGGCACTGAATAGTGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATA 
TNAAAATATAGACTTTTAAAAAAAAAAAAAAAGGGGGNCGGTCGCGATCTAGAACTAGTCCGNNAGAAACCAGCA 
CAGTGGTTAGATAGATAAAGCGGGCGGTCGACTANTCTGAGGTCTGATACTCACTGACT 

SEQ ID NO:268 

30 >GL2_81B_E06_G3F1 Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 

35 TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATATAGACTTTTAAAAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGTCCGNAGAACCCAG 
CACAGTGGTTAGATAGATAAAGCGGCCGCTCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAA 
TTCGTTTAAACCTGCAGGACTAGTCCCTTTATGAGGGTTAAATTCTGAGCTTGGCGTAATCATGGTCAT 

40 SEQ ID NO:269 

>GL2_1_3_M13R Direction: sense 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
45 TACCCATCACAACATACAAAGCAACTACACACTGGATTTTTCCAAAC^CTTTTATTTCCTCAGAGTCTTCCTTA 
ATCCTATGGAACJVAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCATTTTCTACTCCCTCCCACCCAA 
TATAAAAATATAGACTTTTTAAACAAACACAACAACACAACAAAAAGGGCGCCAGCTCGGCGATCTAGAACTAGT 
CCGGCAAGAACCCCAGNAACAGGGGTTAGATA 

50 SEQIDNO:270 

>GL2_81B_E04_G3F1 Direction: N/A 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGCGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GCCCCAGGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTT 
55 CTANCCATCACAACATACAAAGCAACTACACCTGGA 

ATC CTATGGAACAAGAAGCTGGCACTGAATAGGGCCC AGTATANGGGCTTGCGTTTTCTACTC CCTC CCCCCAAT 
ATAAAAATATAGACTTTTAAAAAAAAAAAATNAAAGGGNGCCGCTCGCGATACTAGAACTAGT 
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SEQBDNO:271 

>GL2_226_1_3_JVI13R Direction: anti-sense 

CTAGTTCTAGATCGCGAGCGGCGCCCTTTTTTTTTTTTTTTTAAAAGTCTATATTTTTATATTGGGGGGAGGGAG 
TAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCTGAG 
5 GAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTGAAG 
TGTGAAGGCCCCTCATACCCTCCATCTTGCCTCAGACTATGTCCTGGAACCCTGGGGCGGAGAAAGCGCCACTGT 
TCATTCCTGCTNTGTTGGGATTGTTCGACGGNCACGTAGATGATAGAGAA.CGACAAGCAAGAAGAGCGAANACGC 
CCAGCATGTAAGGCGAAGATGGCGAGCTGCACGTCCGTGATCATCCTGACCAGTCCGGCTCGGA 

10 SEQ JD NO:272 

>GL2_226_ 1_4_M13R Direction: anti-sense 

ACTAGTTCTAGATCGCGAGCGGCGCCCTTTTTTTTTTTTTTTTAAAAGTCTATATTTTTATATTGGGGGGAGGGA 
GTAGAAAAGCAAGCCCCTATACTGGGCCCTATTCAGTGGCAGCTTCTTGTTCCATAGGATTAAGGAAGACTCTGA 
GGAAATAAAAGTTGTTTGGAAAAATCCAGGTGTAGTTGCTTTGTATGTTGTGATGGGTAGAAGGGATGAAGTGAA 

15 GTGTGAAGGCCCCTCATACCCTCCATCTTGCCTCAGACTATGTCCTGGAACCCTGGGGCGGAGAAAGCGCCACTT 
TCATTCCTGCTTCTTGGGATTGTTGACGGCCACGTAGTGATAGCGAACGACAAGCAAGAAGAGCGACACGCCCAG 
CATGTTGGCGACGATGGCGAGCTGCACGTCCGTGATCATCCTGACCAGTCCGGCTCGGCCCGAAGAACCCAGACA 
ACACTGGAGCCACCGTGCCACCGGCCTTGGCGGCTACCTCCTTGTACAAGCTCGGTCCATGCCGAGAGAAGGGCG 
AAATACTGCAGATATCCATCACAACTGGCGGGCCGCATCCGAGCATGCATCTAGAAGGGCCAAATACGCCATATA 

20 GGNGAGTCGGAATAACAATTCACCTGGCCGGCGATCCAAACGTCGTGACTGGAAAAAACCCGGGGGATNACCCAA 
AAAAGTTAAAATCGGCCNGAAGGACANAACCCCACGATTAGGCAAANAGNGCGAAAAATAACNAAAA 

SEQ ID NO:273 

>GL2_81B_E03_G3F1 Direction: sense 

25 GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAGGTGGCGCTTTCTCC 
GCCCCAGGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTT 
CTACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTA 
ATCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATA 

30 TAAAAATATAGACTTTTAAAAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATCTAGAACTAGT 

SEQ ID NO:274 

Clone 1/226 - Novel (Clonel/226) > 

GCCGAGCCGGACTGGTCAGGATGATCACGGACGTGCAGCTCGCCATCTTCGCCAACATGCTGGGCGTGTCGCTCT 
35 TCTTGCTTGTCGTTCTCTATCACTACGTGGCCGTCAACAATCCCAAGAAGCAGGAATGAAAGTGGCGCTTTCTCC 
GCCCCAGGGTTCCAGGACATAGTCTGAGGCAAGATGGAGGGTATGAGGGGCCTTCACACTTCACTTCATCCCTTC 
TACCCATCACAACATACAAAGCAACTACACCTGGATTTTTCCAAACAACTTTTATTTCCTCAGAGTCTTCCTTAA 
TCCTATGGAACAAGAAGCTGCCACTGAATAGGGCCCAGTATAGGGGCTTGCTTTTCTACTCCCTCCCCCCAATAT 
AAAAATATAGACTTTTAAAAAAAAAAAAAAAAAAAAA 

40 

SEQIDNO:275 

Novel (FIiJ10688) 

>GL2__51_C08_G3F1 Direction: sense 

CACCAGCACCCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGA 
45 AGCC ACACTTGAAGTTAGCACGCGTT CAGAGT CAAAATGG CATAGT ACTGTCATGGAGTGTCCTGGAGGTGGATC 
GAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAAT 
GGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGC 
AAATACTACTTTG 

50 SEQIDNO:276 

>GL2_51_C10_G3F1 Direction: sense 

CACCCAGCACCCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAG 
AAGCCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGAT 
CGAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAA 
55 TGGAAAAA.GATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGC 
AAATACTACTTTG 
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SEQ ID NO:277 

>GL2_51_C11_G3F1 Direction: N/A 

CACCCAGCACCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGA 
AGCCACACTTGAAGTTAGCACGCGTTCAGGGGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGAT 
5 CGAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAA 
TGGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGC 
AAATACTACGTTTG 

SEQ ID NO:278 

10 >GL2_59_E07_G3F1 Direction: sense 

CACCAGCACCTTACAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGAAA 
CCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAACTGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGATCG 
AAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAATG 
GAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCAA 

15 ATACTACTTTG 

SEQ ID NO:279 

>GL2_5 9_E08_G3F1 Direction: sense 

CACCCAGCACCCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAG 
20 AAACCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGAT 
CGAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAA 
TGGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGC 
AAATACTACTTTG 

25 SEQ ID NO:280 

>GL2_59_E09_G3F1 Direction: sense 

CACCAGCACCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGAA 
GCCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGATCG 
AAG CTGTGC C ACTGTTGATAGCTACCATCTCTATGCTT AC C ATGAGGAACCCAGTGCCACTGTGCCCTC ACAATG 
30 GAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCAA 
ATACTACTTTG 

SEQIDNO:281 

>GL2_59_E10_G3F1 Direction: sense 
35 CACCCAGCACCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGA 
AGCCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGATC 
GAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAAT 
GGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCA 
AATACTACTTTG 

40 

SEQ ID NO:282 

>GL2_59_E11_G3F1 Direction: sense 

CACCAGCACCCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGA 
AACCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGATC 
45 GAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAAT 
GGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCA 
AATACTACTTTG 

SEQ ID NO:283 

50 >gIj2_59_E12_G3F1 Direction: sense 

(^CC^GCACCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGAA 
GCCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGATCG 
AAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAATG 
GAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCAA 

55 ATACTACTTTG 
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SEQ ID NO:284 

>GL2_51_C09__G3F1 Direction: N/A 

CACCAGCACCTTACCAGAAGCTCCACAACCACAGGCGTCTGGCCCCCCAAGAACAAACAGGGCCCTTTGTGGTCG 
TNCCCCACAAAAGGGGGCCCCCTTATAAACACCCATAAAATTTTTCACCCCTTTTNTCCCCCTTGGGGTGTCCCC 
5 CCCCCCCTTTTTTCCCCAAAAAAGGGGNCCCAAAACCAAAGGGGCCCCCCCCCCTTTAAAAACCCTAAAAACCCC 

SEQIDNO:285 

Novel (FLJ10688)> 

10 CACCCAGCACCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAGCTGCCAGCACATCTCTGCCTCAGA 
AGCCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGTACTGTCATGGAGTGTCCTGGAGGTGGATC 
GAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTACCATGAGGAACCCAGTGCCACTGTGCCCTCACAAT 
GGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGCCCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCA 
AATACTACTTTG 

15 

SEQ ID NO:286 

>gi | 8922591 1 ref |NM__018179 . 1 ) Homo sapiens hypothetical protein FLJ10688 
(FLJ10688) , mRNA 

ATCTAAACAAGAAAGTAGTGAGAGTTTG C C AAAAGAAGCCTTTCTGGTCCTCTCTGATGAAGAGGATATT 

20 TCGGGTGAAAAAGATGAGTCTGAAGTTATATCGCAAAATGAAACGTGCTCTCCAGCAGAAGTAGAAAGTA 
ATGAAAAGGACAACAGACCTGAGGAAGAAGAGCAAGTAATACATGAAGATGATGAAAGACCTTCTGAGAA 
AAATGAATTTTCTAGACGAAAACGTTCTAAATCAGAAGACATGGACAATGTACAGTCTAAACGTCGTCGA 
TATATGGAAGAAGAATATGAGGCAGAATTTCAAGTAAAGATTACAGCCAAAGGAGACATTAACCAGAAAC 
TTCAAAAGGTTATACAGTGGTTGCTGGAAGAAAAATTGTGTGCGCTGCAGTGTGCTGTATTTGATAAGAC 

25 TTTGGCAGAATTGAAAACACGAGTGGAAAAGATTGAATGTAACAAGAGGCATAAAACAGTTCTCACTGAA 
CTACAGGCCAAGATAG CCAGGTTAAC CAAACGCTTTGAAG CAGCCAAAGAAGATCTTAAGAAAAGAC ATG 
AACATCCACCCAACCCACCAGTATCACCAGGAAAAACTGTAAATGATGTCAACAGCAATAATAACATGTC 
TTACAGAAATGCAGGCACAGTGAGACAGATGCTGGAGTCCAAAAGAAATGTAAGCGAGAGTGCACCACCA 
TCCTTTCAAACTCCTGTGAATACAGTATCTTCAACCAATCTTGTCACTCCTCCAGCAGTTGTCAGTAGTC 

30 AACCTAAATTGCAGACTCCAGTGACTTCGGGTTCCCTCACAGCAACGTCAGTTCTTCCTGCACCCAATAC 
AGCTACTGTAGTTGCTACTACTCAGGTGCCTAGTGGAAATCCCCAGCCTACAATCTCTTTACAGCCTTTG 
CCAGTGATTTTGCATGTACCTGTTGCAGTATCCTCCCAGCCTCAGCTTCTACAGAGCCATCCAGGGACTT 
TGGTGACTAATCAACCATCTGGCAATGTTGAATTCATTTCTGTGCAAAGCCCACCTACAGTGAGTGGTCT 
TACCAAAAATCCAGTATCCTTGCCATCCTTGCCAAATCCCACTAAACCAAACAACGTTCCTTCTGTGCCC 

35 AGTCCTAGTATTCAAAGGAACCCTACTGCCAGTGCTGCACCATTGGGAACAACACTTGCTGTGCAGGCTG 
TTCCAACAGCAC^CTCTATTGTACAAGCCAGAAGGACTTCTTTACCCACAGTGGGCCCATCAGGA^ 
TAGTCCATCAACTAATCGAGGTCCTATACAGATGAAAATTCCAATTTCTGCATTTAGTACTTCGTCTGCT 
GCAGAACAGAACAGCAATACCACCCCAAGAATTGAAAACCAGACAAACAAAACAATAGATGCTTCTGTCA 
GTAAGAAAGCAGCTGATAGCACATCACAGTGTGGAAAAGCCACTGGCAGTGATTCAAGTGGTGTCATTGA 

40 TCTCACAATGGATGATGAAGAGAGTGGAGCTTCACAAGACCCCAAAAAACTAAATCACACTCCTGTATCA 
ACCATGAGTTCTTCTCAGCCTGTGTCACGACCATTGCAACCCATACAACCAGCACCGCCTCTTCAACCAT 
CTGGGGTGCCAACAAGTGGACCATCTCAGACCACCATACACTTACTACCTACAGCTCCAACTACCGTGAA 
TGTAACACATCGTCCAGTAACTCAGGTGACCACAAGACTCCCTGTACCAAGAGCTCCTGCAAACCACCAG 
GTGGTTTATACAACTCTTCCTGCACCACCAGCTCAGGCTCCCTTGCGAGGAACTGTTATGCAGGCTCCTG 

45 CTGTTCGGCAGGTCAATCCCCAAAATAGTGTTACAGTTCGAGTGCCTCAAACAACCACATATGTTGTAAA 
CAATGGACTAACCCTGGGATCAACAGGACCTCAGCTCACAGTGCATCACCGACCACCACAAGTGCATACT 
GAGCCCCCACGCCCCGTGCACCCAGCACCCTTACCAGAAGCTCCACAACCACAGCGTCTGCCCCCAGAAG 
CTGCCAGCACATCTCTGCCTCAGAAGCCACACTTGAAGTTAGCACGCGTTCAGAGTCAAAATGGCATAGT 
ACTGTCATGGGGTGTCCTGGAGGTGGATCGAAGCTGTGCCACTGTTGATAGCTACCATCTCTATGCTTAC 

50 CATGAGGAACCCAGTGCCACTGTGCCCTCACAATGGAAAAAGATTGGGGAAGTCAAGGCACTTCCCTTGC 
CCATGGCATGTACTCTCACCCAGTTTGTATCTGGTAGCAAATACTACTTTGCAGTACGAGCCAAGGATAT 
TTATGGACGTTTTGGGCCTTTCTGTGATCCTCAGTCAACAGATGTGATCTCTTCTACCCAGAGCAGTTAA 
ACCTTGGAGCCTTTATATTTTCCTCTTTTAAAATTTCCACCTTTTGGTCTTGTTTTTAATCTTGTGCATG 
ATACCCCATGTAAAATCCACCTTGTGCAAGATTTCTTGGACAGATGTGTGTATACACTACATTTGTTTAT 

55 AACCAGAAGCAAAATAAACTCAGCCCACAAAGCT 
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SEQ ID NO:287 

>gi| 8922592 |ref | NP_060649 . 1 | hypothetical protein FLJ10688; likely ortholog 
of mouse activating transcription factor 7 interacting protein [Homo 
sapiens] 

5 MDNVQSKRRRYMEEEYEAEFQVKITAKGDINQKLQKV 

NKRHKTVLTELQAKI ARLTKRFEAAKEDIiKKRHEHPPNP PVS PGKTVND VNSNWNMS YRNAGTVRQMLES 
KRNVSESAPPSFQTPWTVSSTiTCjVTPPAW 

PQPTISLQPLPVILHVPVAVSSQPQliLQSHPGTLVTNQPSGl^EFISVQSPPTVSGLTKNPVSLPSLPNP 

TKPNNVPS VPSPS IQRNPTAS AAPLGTTLAVQAVPTAHS I VQATRTSLPTVGPSGLySPSTNRGP I QMKI 
10 PISAFSTSSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQD 

PKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPT^ 

PVPRAPAtraQVVTTTLPAPPAQAPLRGTVMQAPA 

VlIHRPPQVTITEPPRPvlIPAPLPEAPQPQRLPPEAASTSL^^ 

TVTDSYHLYAYHEEPSATVTSQWKKIGEfVl^ 
15 BVXSSTQSS 

SEQEDNO:288 

Novel (KIAA0217) 

>G3__4__03_PCR_G3F1 Direction: N/A 

20 TCTTGCGGCAGGGTCAAAACAACATCATGGTACAACTTCTGGTAACATGTCAGTGGGCAACAAAGTGTATGTTTT 
CATGTAGTTCTATTCACAATTGGTTCCTTTGAATTTATATTACATAATACCACTTGAAACTGGAGTGAGAAGCTG 
GTTACTTGCTTGCAGTGCATCAGAACCAAGGTTTCTGGGGTGGCCCCTGCAGTTCACTCCGCCCAGGAGATCAAA 
AAGGATGCGCTCTCTACATTCTTAAATGCCACAGTCCCCTACCAACGGCTTAGTTGAAGCACAGCTCCCACATCA 
GCACTTATTCCCTGGCGCGGCCAGCACTGCCTCTACGAGGCCACACCTGCCTCTGCCAAGGCCACTGGCCCTCAT 

25 TCATCTTCGGGGACGTTCTAGAGATATGCACTTACAG 

SEQ ID NO:289 

>GI>3_4 . 03_A_M13F Direction: anti-sense 

CTGTAAGTGCATATCTCTAGAACGTCCCCGCAGATGAATGAGGGCCAGTGGCCTTGGCAGAGGCAGGTGTGGCCT 
30 CGTAGAGGCAGTGCTGGCCGCGCCAGGGCATCAGTGCTGATGTGGGAGCTGTGCTTCCACCTAAGCCGTTGGTAG 
GGGACTGTGGCATTTAAGAATGTAGAGAGCGCATCCTTTTTGATCTCCTGGGCGGAGTGAACCTGCAGGGGCCAC 
CCCAGAAACCTTGGTTCTGATGCACTGCAAGCAAGTAACCAGCTTCTCACTCCAGTTTCAAGTGGCTATTATGTA 
ATATAAATTCAAAGCAGA.TTGTGAATAGAACCTACATGAAAACATACACTTTGTTGCCCACTGACATGTTACCAG 
AAGTTGTACCATGATGTTGTTTTGACCCCTG 

35 

SEQIDNO:290 

>GL3_4 . 03_A_M13R Direction: sense 

CAGGGGTCAAAACAACATCATGGTACAACTTCTGGTAACATGTCAGTGGGCAACAAAGTGTATGTTTTCATGTAG 
GTTCTATTCACAATGTGCTTTGAATTTATATTACATAATAGCCACTTGAAACTGGAGTGAGAAGCTGGTTACTTG 
40 CTTGCAGTGCATCAGAACCAAGGTTTCTGGGGTGGCCCCTGCAGGTTCACTCCGCCCAGGAGATCAAAAAGGATG 
CGCTCTCTACATTCTTAAATGCCACAGTCCCCTACCAACGGCTTAGGTGGAAGCACAGCTCCCACATCAGCACTG 
ATGCCCTGGCGCGGCCAGCACTGCCTCTACGAGGCCACACCTGCCTCTGCCAAGGCCACTGGCCCTCATTCATCT 
GCGGGGACGTTCTAGAGATATGCACTTACAG 

45 SEQIDNO:291 

>GL3_4 . 03_B_M13F Direction: sense 

CAGGGGTCAAAACAACATCATGGTACAACTTCTGGTAACATGTCAGTGGGCAACAAAGTGTATGTTTTCATGTAG 
GTTCTATTCACAATGTGCTTTGAATTTATATTACATAATAGCCACTTGAAACTGGAGTGAGAAGCTGGTTACTTG 
CTTGCAGTGCATCAGAACCAAGGTTTCTGGGGTGGCCCCTGCAGGTTCACTCCGCCCAGGAGATCAAAAAGGATG 
50 CGCTCTCTACATTCTTAAATGCCACAGTCCCCTACCAACGGCTTAGGTGGAAGCACAGCTCCCACATCAGCACTG 
ATGCCCTGGCGCGGCCAGCACTGCCTCTACGAGGCCACACCTGCCTCTGCCAAGGCCACTGGCCCTCATTCATCT 
GCGGGGACGTTCTAGAGATATGCACTTACAG 

SEQ ID NO:292 
55 >gIj3_4 . 03_B — M13R Direction: anti-sense 

CTGTAAGTGCATATCTCTAGAACGTCCCCGCAGATGAATGAGGGCCAGTGGCCTTGGCAGAGGCAGGTGTGGCCT 
CGTAGAGGCAGTGCTGGCCGCGCCAGGGCATCAGTGCTGATGTGGGAGCTGTGCTTCCACCTAAGCCGTTGGTAG 
GGGACTGTGGCATTTAAGAATGTAGAGAGCGCATCCTTTTTGATCTCCTGGGCGGAGTGAACCTGCAGGGGCCAC 
CCCAGAAACCTTGGTTCTGATGCACTGCAAGCAAGTAACCAGCTTCTCACTCCAGTTTCAAGT 
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ATATAAATTCAAAGCACATTGTGAATAGAACCTACATGAAAACATACACTTTGTTGCCCACTGACATGTTACCAG 
AAGTTGTACCATGATGTTGTTTTGACCCCTG 

SEQ ID NO:293 
5 >GL3_4.03_C_M13F Direction: anti-sense 

CTGTAAGTGCATATCTCTAGAACGTCCCCGCAGATGAATGAGGGCCAGTGGCCTTGGCAGAGGCAGGTGTGGCCT 
CGTAGAGGCAGTGCTGGCCGCGCCAGGGCATCAGTGCTGATGTGGGAGCTGTGCTTCCACCTAAGCCGTTGGTAG 
GGGACTGTGGCATTTAAGAATGTAGAGAGCGCATCCTTTTTGATCTCCTGGGCGGAGTGAACCTGCAGGGGCCAC 
CCCAGAAACCTTGGTTCTGATGCACTGCAAGCAAGTAACCAGCTTCTCACTCCAGTTTCAAGTGGCTATTATGTA 
10 ATATAAATTCAAAGCACATTGTGAATAGAACCTACATGAAAACATACACTTTGTTGCCCACTGACATGTTACCAG 
AAGTTGTAC CATGATGTTGTTTTGAC CCCTG 

SEQ ID NO:294 

>GL3_4 . 03_C_M13R Direction: sense 

15 CAGGGGTCAAAACAACATCATGGTACAACTTCTGGTAACATGTCAGTGGGCAACAAAGTGTATGTTTTCATGTAG 
GTTCTATTCACAATGTGCTTTGAATTTATATTACATAATAGCCACTTGAAACTGGAGTGAGAAGCTGGTTACTTG 
CTTGCAGTGCATCAGAACCAAGGTTTCTGGGGTGGCCCCTGCAGGTTCACTCCGCCCAGGAGATCAAAAAGGATG 
CGCTCTCTACATTCTTAAATGCCACAGTCCCCTACCAACGGCTTAGGTGGAAGCACAGCTCCCACATCAGCACTG 
ATGCCCTGGCGCGGCCAGCACTGCCTCTACGAGGCCACACCTGCCTCTGCCAAGGCCACTGGCCCTCATTCATCT 

20 GCGGGGACGTTCTAGAGATATGCACTTACAG 

SEQ ID NO:295 

>GL3_4 . 03_D_M13F Direction: anti-sense 

CTGTAAGTGCATATCTCTAGAACGTCCCCGCAGATGAATGAGGGCCAGTGGCCTTGGCAGAGGCAGGTGTGGCCT 
25 CGTAGAGGCAGTGCTGGCCGCGCCAGGGCATCAGTGCTGATGTGGGAGCTGTGCTTCCACCTAAGCCGTTGGTAG 
GGGACTGTGGCATTTAAGAATGTAGAGAGCGCATCCTTTTTGATCTCCTGGGCGGAGTGAACCTGCAGGGGCCAC 
CCCAGAAACCTTGGTTCTGATGCACTGCAAGCAAGTAACCAGCTTCTCACTCCAGTTTCAAGTGGCTATTGTGTA 
ATATAAATTCAAAGCACATTGTGAATAGAACCTACATGAAAACATACACTTTGTTGCCCACTGACATGTTACCAG 
AAGTTGTACCATGATGTTGTTTTGACCCCTG 

30 

SEQ ID NO:296 

>GL3_4 . 03JD_M13R Direction: sense 

CAGGGGTCAAAACAACATCATGGTACAACTTCTGGTAACATGTCAGTGGGCAACAAAGTGTAT 
GTTCTATTCACAATGTGCTTTGAATTTATATTACACAATAGCCACTTGAAACTGGAGTGAGAAGCTGGTTACTTG 
35 CTTGCAGTGCATCAGAACCAAGGTTTCTGGGGTGGCCCCTGCAGGTTCACTCCGCCCAGGAGATCAAAAAGGATG 
CGCTCTCTACATTCTTAAATGCCACAGTCCCCTACCAACGGCTTAGGTGGAAGCACAGCTCCCACATCAGCACTG 
ATGCCCTGGCGCGGCCAGCACTGCCTCTACGAGGCCACACCTGCCTCTGCCAAGGCCACTGGCCCTCATTCATCT 
GCGGGGACGTTCTAGAGATATGCACTTACAG 

40 SEQIDNO:297 

>gi | 22051279 | ref | XM_040265 . 5 | Homo sapiens KIAA0217 protein (KIAA0217) , 
mRNA 

AGCATCCTGAGGTTAGCATTACACTTCTCCCAGTGGAGCCCATGACTTCTGATCAGGACGCTAAGGTTGT 
GGCTGAACCGCAGACGCAGAGAGTCCAGGAGGGCAAGGACAGCGCTCATCTGATGAATGGTCCTATATCT 

45 CAAACCACTTCTCAGACAAGTTCCATCCCACCTTTGAGTCAGGTACCAGCAACTAAGGTTTCAGAGCTGA 
ACCCTAATGCAGAAGTGTGGGGGGCTCCTGTGTTACATCTGGAAGC2VAGCAGTGCTGCTGACGGTGTGAG 
TGCTGCATGGGAGGAGGTGGCTGGCCACCACGCAGACCGTGGCCCGCAGGGATCGGATGCCAATGGTGAT 
GGTGACCAGGGCCATGAGAATGCCGCATTGCCAGACCCGCAGGAGTCGGACCCAGCAGACATGAACGCTC 
TCGCTCTGGGTCCCTCAGAATATGACTCTCTGCCTGAAAATAGCGAGACAGGAGGAAATGAGTCTCAACC 

50 AGACAGCCAGGAAGACCCCCGAGAAGTACTTAAAAAAACATTGGAATTCTGCTTATCTAGGGAGAACCTT 
GCTAGTGACATGTATCTTATATCACAGATGGATAGTGACCAGTATGTGCCAATCACAACGGTGGCTAACC 
TCGACCACATCAAGAAGCTCAGCACTGATGTGGACTTGATTGTGGAAGTGCTAAGATCTTTACCTTTAGT 
CCAAGTGGATGAAAAGGGAGAAAAAGTAAGGCCAAATCAAAATCGCTGCATAGTAATATTGCGTGAAATA 
TCTGAATCTACCCCCGTGGAAGTAATAGAAGCACTATTTAAAGGAGATAATTTACCAAAATTTATAAACT 

55 GTGAATTTGCATATAATGATAATTGGTTTATTACATTTGAAACAGAAGCTGATGCACAACAGGCTTACAA 
ATACCTTCGAGAAGAAGTCAAAACTTTTCAAGGAAAACCAATTAAGGCACGGATAAAAGCAAAGGCAATA 
GCTATAAACACATTTTTGCCAAAGAATGGATTTAGACCCCTGGACGTGAGCCTGTATGCCCAGCAGCGCT 
ACGCGACGTCGTTCTACTTCCCTCCCATGTACAGCCCCC^GCAGCAGTTCCCCCTGTACAGCCTGATCAC 
TCCCCAGACGTGGTCAGCAACGCACAGCTATCTTGACCCACCCTTGGTAACTCCATTTCCAAATACTGGA 

60 TTTATAAATGGGTTTACGTCTCCAGCGTTCAAGCCTGCGGCGTCTCCTCTGACTTCTCTCAGACAGTATC 
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CTCCTCGAAGCAGGAATCCTAGTAAATCTCATCTGCGGCATGCGATTCCTAGTGCAGAGAGGGGACCTGG 
GTTATTAGAAAGTCCTTCAATATTTAACTTCACTGCAGATCG ATTAATTAATG GTGTCCGGAGTC CACAA 
ACAAGGCAAGCAGGTCAAACTAGAACACGGATTCAAAACCCTTCAGCATATGCCAAGAGAGAGGCTGGGC 
CTGGGCGTGTGGAGCCAGGCAGTCTCGAATCCTCTCCTGGTTTAGGGAGGGGAAGGAAGAATTCCTTTGG 
5 CTACCGGAAGAAAAGGG AGGAGAAGTTTACAAGCAGCCAGACAC AGTCTC CAACGCCACC AAAGC CTCCG 
TCGCCAAGCTTCGAGCTGGGGCTGTCCAGCTTCCCTCCATTACCTGGAGCTGCCGGCAATTTGAAGACAG 
AGGACTTGTTTGAAAACAGGCTATCTAGCTTGATAATAGGACCATCCAAAGAAAGGACCCTCAGTGCAGA 
CGCAAGCGTGAACACCCTTCCTGTAGTGGTCTCCAGAGAGCCCTCGGTGCCGGCTTCTTGTGCTGTATCA 
GCAACGTACGAGCGATCCCCCTCCCCAGCTCATTTACCCGATGATCCCAAGGTGGCGGAGAAACAGAGGG 

10 AAACCCACAGTGTGGACAGACTTCCTTCCGCCCTCACTGCGACCGCGTGTAAATCGGTGCAGGTGAACGG 
AGCCGCCACGGAATTGCGAAAGCCCAGCTACGCAGAGATTTGTCAGAGAACGAGTAAAGAGCCTCCTTCT 
TCCCCATTGCAACCCCAAAAAGAACAAAAGCCAT^ACACTGTTGGTTGTGGGAAGGAGGAAAAGAAGCTGG 
CAGAGCCCGCAGAGAGATACCGGGAGCCCCCAGCCCTCAAGTCCACACCTGGAGCCCCCAGAGACCAGAG 
GCGGCCGGCGGGGGGCCGGCCCTCGCCCTCGGCCATGGGGAAGCGTCTCAGCCGAGAGCAGAGCACTCCC 

15 CCCAAGTCTCCTCAGTGAAAACCGTACGTCTGGGAGGGGTCGCAGAGCGCTGTGTTAACCACAAACGAGA 
CACTCTCCCACTCAGTGCGAGGGCGAGCCGCTGGTTAGGAGCTTGCAGTGTCTGAGGCCTGTGGGATCCT 
CAAGTTGGTTTTCTTCTGTGAGTTGGATTCTCCCCCTCTTGAAAAAAAATCGATTTTTCAGGATTTAATT 
AATACAAACCTTATTTTAGGTTGGTGCTTAACTGGAGGTGATGCATAAGTCTGATTTTTTTTTCCAAGAT 
AGAAAAAGCATTTATC CTAACAAATTGGTATTTTTTATTAAGCCTC C ATGTGG CTCTGAATGC AAGCTAT 

20 ATATAGTGAGTTTTTCTAAATTAAGGGAACTCTGCTTTTTTTTTTTTTTTTTAAGTAACTGGTCTGTAAG 
TGCATATCTCTAGAACGTCCCCGCAGATGAATGAGGGCCAGTGGCCTTGGCAGAGGCAGGTGTGGCCTCG 
TAGAGGCAGTGCTGGCCGCGCCAGGGCATCAGTGCTGATGTGGGAGCTGTGCTTCCACCTAAGCCGTTGG 
TAGGGGACTGTGGCATTTAAGAATGTAGAGAGCGCATCCTTTTTGATCTCCTGGGCGGAGTGAACCTGCA 
GGGGCCACCCCAGAAACCTTGGTTCTGATGCACTGCAAGCAAGTAACCAGCTTCTCACTCCAGTTTCAAG 

25 TGGCTATTATGTAATATAAATTCAAAGCACATTGTGAATAGAACCTACATGAAAACATACACTTTGTTGC 
CCACTGACATGTTACCAGAAGTTGTACCATGATGTTGTTTTGACCCCTGTGAGCTGATGGCCCCGGCCCT 
GCTCTGTGCACATTTCTGTCCGTGTTCCCCAGCACTCTGGTTGGAGAGAGTCCACATCTTCAGCTCCGTG 
TGGACATCTCCCTGTACCTCTGCATCAGCACATGGATTTAAGAGTTATGTAATCGTGAGAGAATGGTGTT 
TGTGGTTTTTCCCCCTCTTTGGCTGGTGGAGGATAAAGTTCCTGCTCTTTTACCTCCAAGACGAGGGCCT 

30 CATTGATTCACTTCCAGAAGTGCTGCACTTCTGAAGAACAAGGATGCACTAAAGTTAGCAAGTTTATAAT 
AAAGTTAAATATAAATTATTTTGTTTTAAAATGCCTCAAA 

AATAAGAATATTTCCTGCTAAATGTAACCATACACTTTATTCCACAAAATGTTATTTAACAAGACTGAGG 
GTTTTTTTTAAGAAAAAATTATTTCCATCCAATATTTAAAGACTTGAATTTTATTTAAACTTGAAAATGA 
CTTTGCCTTAACTTTTGTATAAGACAGCTTAGAGTCCATGGAGCCCGGCCCTGGGTTGGCGTGAGTGGGT 

35 CAGAGTTACTCAGTTACTGCGTGGATCTCCTGTCGCTAGTTTTACTGAGTAAGCATACTGTAGTACAAGA 
GCTAGTAGTAGTTTTTGTAATATACCTTAAAGATCTTCAACAGTTGATCTTTTTTCAGAATGTTGGAAAA 
TCCTGTAAATGCAAATAGTCAATACTGTATTAAATACGTGCACTTGGAGTGTGCTTCGCTTGTACAGTTG 
TAAATAATCAGAACATATGAAAAAGGTACCCTACAGAGAAAATTCTGATACAGATTATTGATATATTATA 
AATGTTGCTGTTGAGCGGGATGTAGATAAACTAAATGTTGTGGTTTGAATATTATTTTGATTTGTTGAGA 

40 TTTTCTTTTTTCTCTTACATCGGTGTGTTGAACTGATTCTGCCTCTTTGCTGCAAAAGGGAATTGGAAAG 
TCTTATTAAAAGCCTCCAGATGTTTTCATACTCTTTTAAAATGTATGTAAATGCATACTAATCATATCTA 
ATGTGAAAGAGTTTTAAAGTATATAGAGAGCAAAAACTGGCAGGATCGTAAGTGAAGGTGACTAGTAATC 
TAATTTAAATCACCTGCAGCTAAGCATGATTGACCCTGCCAGAGGAAAACATGCCTATTTGACCATTTCC 
TTTAAAGCAGTTGCCATTATTCAAATACAGAGAAATAGCCACAGGGCTAGTGTTTTTCAAATGCATTTTA 

45 AAGAACATGGGGATTTTTTTTTGTAGTTGTCAGTTCACTGACCAAAAAAAAAAAAAAAATCAGAAATAAT 
TGATCTGTGAAACCCAAACTCTCAATACTCAGAAAGCTGGGAGGCAACCTCGAGGCCTGGGCCTACGAGC 
TGCATCTTCGCTACGGAAGGGCCAGGGCGCCATCAGCCATTCCCAAAACACAAGGCCTGCCCGTCCGCCA 
GTGAGTCCTTGGTTTTTAATAATGAGAAGTCCTTTCCCCCAAGGTGTGAGCATTGCAGCGCAGTGTGTGT 
GTGTGGTTAGAGCCAGCTTAGTCCTTCACTTTGTCGACCGAAGTGGGAGCTCAACAGCTGCATGAGGAGG 

50 GCAGCGCGTGCATTAGCCAGTCGCCACTGGAGGGCTCTGCTGCCCTCCGGTCAATACACTGTAGTTACTG 
CCTAGCCAGCAGCAGTCTTCTGCATCAAGAACTGAAACGTTGCTCGGAGGTGATTTTTATAGCATCCTTT 
TTAATTAAAGGTGAAATAGAGATTGCTATATAATGTCTGAAAAAACCTGATACTACTTCAAGAGTTTCTG 
CTCAGAAGAAAATGAGAGTTATCATAATAGGAAGCTGTGGCGGTCCATGCCAACTGTGCTGTGTCACATA 
CAGCGATGAGAGTGGCTTTCATACTTTTTTTTTTTTTAAGTTAACACCCTCCTTTACCCCCAGCAGTATC 

5 5 TGAGGTTATAGAATCAGAGATGCAGCAGTGACAAATGGCATTTTAACTTGTAAAATCGTGTGATGATGCT 
TATCATTTTGAAATAGAAGAATAAAAACCTGGTCCCGTTTC 

GTTCTCTGAGAGTGAGTGTCTTGACATTTTCACCCAGGCCCTCCTGTCATCACATCACCGGCTGTCACTG 
GCGGGTGGCCGTAAACGTCCTGCGTTGCTATATTAGGATCTCTGCAGTTCAGGCTTGAAAACCAGTTCAG 
TGTATCCGGGCGACGGGTAGTGGTGGTGCATGCCTGTCTGTGTGCCCCGCTGGCGAGCTGTAGTTGCGGC 
60 TTGCGTGCCTCGCGGCCCACTACAGGGCTGCAGACAATCGAGGCGAGGGCGCTGGCCGCCAGCAGCTCAC 
AGCGCGGGGGTCATGTGGTCGCTCCTCGAGGGTTTCGTTTTTGTTCTGCTTCATTAAGACTGGAATCAAG 
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CTTACATGTAAACTATTGGTAATTTAAGTTTCCTTTTGTGTCATTCAGTGTAAAACTGTCTAATTTGAAA 
AAAAATGTAGGTTATGAA7VATAAAGATTTAGGCACTGTTC 

SEQ ID NO:298 

5 >gi| 20543999 ) ref |XP_040265 .4 | similar to no similarities to reported gene 
products [Homo sapiens] 

MTSDQDAKWAEPQTQRVQEGKDSAHLMNGPISQTTSQTSSIPPLSQVPATKVSELNPNAEVWGAPVLHL 
EASSAADGVSAAWEEVAGHHADRGPQGSDANGDGDQGHENAALPDPQESDPAD^ALALGPSEYDSIjPEN 
SETGGNESQPDSQEDPREVLKKTLEFCLSREN^^ 

10 VEVLRSLPLVQVDEKGEKVRPNQNRCIVILREISESTPVEVIEALFKGDNLPKFINCEFAYNDNWFITFE 
TEADAQQAYKYLREEVKTFQGKPIKARIKAKAIAINTFLPKNGFRPLDVSLYAQQRYATSFYFPPMYSPQ 
QQFPLYSLITPQTWSATHSYLDPPLVTPFPNTGFINGFTSPAFKPAASPLTSLRQYPPRSRNPSKSHLRH 
AIPSAERGPGLLESPSIFNFTADRLINGVRSPQTRQAGQTRTRIQJSTPSAYAKREAGPGRVEPGSLESSPG 
IiGRGRKNSFGYRKKREEKFTSSQTQSPTPPKPPSPSFELGLSSFPPLPGAAGNLKTEDLFENRIiSSLIIG 

15 PSKERTLSADASVNTLPVWSREPSVPASCAVSATYERSPSPAHLPDDPKVAEKQRETHSVDRLPSALTA 
TACKSVQVNGAATELRKPS YAE ICQRTS KEPPS SPLQPQKEQKPNTVGCGKEEKKLAEPAERYREPPALK 
STPGAPRDQRRPAGGRPSPSAMGKRLSREQSTPPKSPQ 

SEQ ED NO:299 

20 Novel (KIAA1583) 

>GL2__89_2B04_G3F1 Direction: N/A 

ACTGCAGGTGGCAGCACGGGCGCGCCGCTCGGGCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGGCCTGCAGGG 
CGGGGAGGCCGGGACTGGGCGTCAGCGCTGAANGGGCCCAGCTGCCAGGGGGCCAGCTGCTGGAGAACCGCAGCT 
CGTCCCCGGCGGCTCCTAATCACCAGCAGCTCCTGTTTCTCAAACGCAGACATCCGCCCTCTTGGGGTCAGGCCC 

25 

TTCCACCTGCAGTGCGAGCCGCCCAGCCACTCCCGACTGGCGCTATGCCTCGATCACCGCTCTTGCTCCCAAGTG 
GACCGCAGGGTGAGACGCTCTCTTACGTGGGACCCTGGGTGTGCGCTCACTCTCTGAAGGGCCTGGAAGCTAGAT 
TCCAGAGGCGTGGGCCAACTCTCCCTGGGTTTTTGGGTGAGCCCCCTCCGAGGGTGTTCATTTCCTGAGCTCTGT 
GTCATCTTAAGCTCTGAGGGTNCGNAGAAACCAGACAGTGGTTAGATAGATAAAGCGGGCGGTCGAATAATCTGA 
GGTCTGATACTCACTGACTGTCGTAAGGGCGAATTACGGGGCGNTAAAATTCNAATTGCCTATAGTGAGTCTGTA 
30 TTACAAATTCACTGGCGCGGTTTTACAAANGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCTTGA 
GGAATCCCCCTTTCGCAGTGGGGTATAGGAGAAGGCGACGAATGCCTCCAACAGTGGGCAGCMAAGTACGACGGT 
AAGGGTANACCTATAAAGAAANCGGACG CGGNGGAGC ACGGGAAAATAC CGGAGAGAGTGNGACC CCTGCCAGAC 
GGATTG 

35 SEQ ID NO:300 

>GL2_2 02_3JV113F Direction: anti -sense 

ACCCTCAGAGCCTAAGATGACACAGAGCTCAGGAAATGAACACCCTCGGAGGGGGCTCCCCAAAACCCAGGGGAG 
AGGTGGCCCACGCCTCTGGAATCTAGCTTCCAGGCCCTTTTTCAGAGAGTGAGCGCCCCCAGGGGTCCCCGTAAG 
AGAGCGTCTCCCCTGCGGTCCACTTGGGAGCAAGAGCGGTGATCGAGGCATAGCGCCAGTCGGGAGTGGGCTGGG 
40 GCGGCTCGCCTGCAGGGTGGAAGGGCCTGACCCCAAGAGGGGCGGATGTCTGCGTTTTGAGAAACAGGAGCTGCT 
GGTGATTAGGAGCCGCCGGGGACGAGCTGCGGGTCT 

SEQ ID NO:301 

>GL2_ 202_3_M13R Direction: sense 
45 ACTGCAGGTGGCAGCCACGGGGCGGCCGCCCCCGGGTCGGCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGCCT 
GCAGGGCGGGGAGGCCGGGACTGGCCGTCAGCGCTGAACGGCCCAGCCTGCCCAGGCCCAGCTGCTGGAGACCCG 
CAGCTCTGTCCCCGGCGGCTCCTAATCACCAGCAGCTCCTGTTTCTCAAACGCAGACATTCGCCCCTCTTGGGGT 
CAGGCCCTTCCACCTGCAGGCGAGCCGCCCAGCCACTCCCGACTGGCGCTATGCTC 

50 SEQ ID NO:302 

>GL2_89_2B01_G3F1 Direction: sense 

ACTGNTAGNGTGGCAGCACGGGCGCGCCGCTCGGCCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGNGCCTGCA 
GTGGGCGGGGAGTGCCGGGACTGGCCGTCAGCGCTGAACGGGCCCAGCTGGCCAGGGGCCAGCTGCTGGAGACCC 
GCAGCTCGTCCCCGGCGGWTCCTAATCACCAGCAGCTCCTGTTTCTCAAACGCAGACATTCTGGCCCTCTTGGGG 
55 TCAGTGCGCCTTCCACCTGCAGTGCGAGCCTGGCCCAGACCACTCCCGACTGGCGCTATGCCTCGATCACCGGTC 
TTGCTCCCAAGTGGACCGCAGGGTGAGACGCTCTCTTACGGGGACCCTGGGGGCGCTCACTCTCTGAAGTGGCCT 
GGAAGCTATGATTCCAGAGGCGTGGGCCAACTCTCCCTGGGTTTTGGGGAGCCCCCTCCGAGGGTGTTCATTATC 
CTGAGCTCTGGTGTCATCTTAGGCTCTGAGGGTCCTGCAGAAACCAGGCACCAGTGGTTAGATAGATAAAGCTGG 
GCGGTCGACATAGATCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTC 

60 
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SEQ ID NO:303 

>GL2_89_2B02_G3F1 Direction: sense 

ACTGCAGGTGGCAGCCACGGGCGCGCCGCTCGGCCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGGCCTGCAGG 
GCGGGGAGGCCGGGACTGGCCGTCAGCGCTGAACGGGCCCAGCCTGCCCAGGGCCCAGCTGCTGGAGACCCGCAG 
5 CTCGTCCCCGGCGGCTCCTAATCACCAGCAGCTCCTGTTTCTCAAACGCAGACATCCGCCCCTCTTGGGGTCAGG 
CCCTTCCACCTGCAGNGCGAGCCGCCCCAGCCCACTCCCGACTGGCGCTATGCCTCGATCACCGCTCTTGCTCCC 
AAGTGGACCGCAGGGGAGACGCTCTCTTACGGGGACCCTGGGGGCGCTCACTCTCTGAAGGGCCTGGAAGCTAGA 
TTCCAGAGGCGTGGGCCACCTCTCCCTGGGTTTTGGGGAGCCCCCTCCGAGGGTGTTCATTTCCTGAGCTC 

10 SEQ ID NO:304 

>GI,2_89_2B03_G3F1 Direction: sense 

CAGTGGTTAGATAGATAAAGCGGCCGCTCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATT 
CGCGGCCGCTAAATTCAATTCGCCCTATAGTGAGTCGTATTACAATTCACTGGCCGTCGTTACAACGTCGTGACT 
GGGAAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCTTTCGCAGCTGGCGTAATAGCGAAGA 
15 GGCCCGACCGATCGCCTCCCAACAGTGCGCAGCTATACGTACGGGAGTAAGGGTNCACCTATNAAGAANAGCGAC 
CGCCGGTGGGCGNCACCACAGGGACAANAATGGACGCACGANCAACGAACGTCACCCAGAACAGAANACTAGTCA 
AAAAANCCGAAAAACNGAGAGACTAAGGAAAANNCAAAACAAAAACACAAGAAAACCAACC 

SEQ ID NO:305 

20 >Gl42_89_2B05_G3Fl Direction: N/A 

ACTGCAGGTGGCAGCCACGGGCGCGCCGCTCGGCCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGGCCTGCAGG 
GCGGGGAGGCCGGGACTGGCCGTCAGCGCTGAACGGCCCCAGCCTGCCCAGGGGCCCAGCTGCTGGAGACCCGCA 
GCTCGTCCCCGGCGGCTCCTAATCACCAGCAGCTCCTGTTTCTCAAACGCAGACATCCGCCCCTCTTGGGGTCAG 
GCCCTTCCACCTGCAGGCGAGCCGCCCCAGCCCACTCCCGACTGGCGCTATGCCTCGATCACCGCTCTTGCTCCC 

25 AAGTGGACCGCAGGGGAGACGCTCTCTTACGGGGACCCTGGGGGCGCTCACTCTCTGAAGGGCCTGGAAGCTAGA 
TTCCAGAGGCGTGGGCCACCTCTCCCTGGGTTTTGGGGAGCCCCCTCCGAGGGTGTTCATTTCCTGAGCTCTGTG 
TCATCTTAGGCTCTGAGGGT 

SEQIDNO:306 

30 >GL2_8 9_2B06_G3F1 Direction: N/A 

ACTGCANGTNGNAGCCACGGGCGCGCCGCGTCGGGCTCATCTGACGCCTCTTTCAGCGGGGTTCCGCAGTGCCTG 
CAGGGGCGGGGAGTGCCGGGACTGGGCGTCAGCGCTGAAAGGGCCCAGCTGTCCAGTGGGCCAGCTGCTGGAGAA 
ACCGCGAGGNGTCTGGTTCCCGGCGGGGTCCCTAAGTNACCAGCAGAGNGTCCTGGTTTCTCAAAACGCAGACAT 
TCTGGCCCTCTTGGGGGTCAGGGCCTTTCACCTGCAGGCGAGCCCGCGCCCAGNGGCCGACGGTCCCCGAAACAT 

3 5 GGGGCGGCGTAATTGTCCTTCTCGAATCTAAACAGCGGTCC 

SEQIDNO:307 

Clone 89 -Novel (KIAA1583)> 

ACTGCAGGTGGCAGCCACGGGCGCGCCGCTCGGCCTCATCTGACGCCTCTGCAGCGGGTTCCGCAGGCCTGCAGG 
40 GCGGGGAGGCCGGGACTGGCCGTCAGCGCTGAACGGCCCAGCCTGCCCAGGGCCCAGCTGCTGGAGACCCGCAGC 
TCGTCCCCGGCGGCTCCTAATCACCAGCAGCTCCTGTTTCTCAAAACGCAGACATCCGCCCCTCTTGGGGTCAGG 
CCCTTCCACCCTGCAGGCGAGCCGCCCCAGCCCACTCCCGACTGGCGCTATGCCTCGATCACCGCTCTTGCTCCC 
AAGTGGACCGCAGGGGAGACGCTCTCTTACGGGGACCCTGGGGGCGCTCACTCTCTGAAGGGCCTGGAAGCTAGA 
TTCCAGAGGCGTGGGCCACCTCTCCCTGGGTTTTGGGGAGCCCCCTCCGAGGGTGTTCATTTCCTGAGCTCTGTG 
45 TCATCTTAGGCTCTGAGGGT 

SEQIDNO:308 

>gi | 10047240 |dbj |AB046803 . 1 | Homo sapiens mRNA for KIAA1583 protein, 
partial cds 

50 GTGGTGAGAGGGGAAGGGACATTTACAGCCCTTTTCTCCCTCCCTTGCCCACAACTACTGTGCCCCAGCA 
TGGCACCCGGGTCAGCCCCTGCCAGCTCTGGAGCTGTCATCATCTCTCCCTCTTATGCCTCTTCAGTGGA 
CTGTGGCCAGGCTCCCCTGGACCCTGTCTACCTGCCGGCAGCCCTGGAGCTCCTAGACGCCCCTGAACAC 
TTCCGTGTGCAGCAGGTGGGCCACTACCCACCTGCCAACTCCTCTCTGAGCTCCCGATCTGAGACCTTTC 
TGCTCCTACAGCCCTGGCCCAGGGCCCAGCCACTTCTCCGGGCCTCCTACCCACCTTTTGCCACTCAGCA 

55 GGTGGTCCCCCCTCGAGTCACTGAGCCCCACCAACGGCCAGTCCCATGGGACGTGCGGGCCGTTTCAGTG 
GAAGCGGCTGTGACTCCAGCAGAGCCCTACGCCCGGGTTCTCTTCCACCTCAAAGGGCAGGATTGGCCAC 
CAGGGTCTGGCAGCCTGCCCTGTGCCCGGCTCCATGCCACACACCCTGCCGGCACTGCTCACCAAGCCTG 
CCGCTTCCAGCCATCCCTGGGCGCCTGCGTGGTGGAGCTGGAGCTTCCCTCGCACTGGTTCTCACAGGCC 
TCCACCACACGGGCCGAGCTGGCCTACACGCTTGAGCCTGCAGCTGAGGGCCCTGGGGGCTGTGGCTCCG 

60 GCGAGGAGAACGACCCTGGGGAGCAGGCCCTCCCAGTGGGGGGTGTGGAGCTGCGCCCAGCAGACCCCCC 
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GCAGTACCAGGAGGTACCTCTGGACGAGGCTGTGACTCTGCGGGTGCCTGACATGCCAGTGCGGCCCGGC 
CAGCTCTTTAGTGCTACCCTCCTGCTTCGGCACAACTTCACAGCCAGCCTCCTGACCCTGCGGATCAAGG 
TGAAGAAGGGGCTGCATGTGACAGCCGCCCGCCCAGCCCAGCCCACACTCTGGACTGCCAAGCTGGACCG 
CTTCAAGGGCTCCAGGCACCACACCACCCTCATCACCTGCCACCGTGCTGGGCTCACAGAGCCAGATTCC 
5 AGTCCCCTTGAACTGTCTGAGTTCCTATGGGTGGACTTTGTGGTGGAGAATAGCACTGGTGGGGGCGTAG 
CGGTCACTCGCCCCGTCACGTGGCAGCTGGAGTACCCAGGCCAGGCCCCTGAAGCAGAGAAGGACAAAAT 
GGTGTGGGAAATCCTGGTGTCTGAGCGGGACATCAGAGCCCTTATCCCACTGGCCAAGGTAAGGAGACCT 
CCATCTCTGCCTGGGAAGGCTGGAGGCCAAGTCCCAGGAGCCCCATGAGCTGAGGGGCCTGGGGCTGAGC 
CCCTCCTCCACCCCCAGGCTGAGGAGCTGGTGAATACAGCACCACTGACTGGAGTGCCCCAGCATGTCCC 

10 CGTGCGCCTTGTCACTGTGGACGGCGGGGGGGCCTTGGTGGAGGTGACAGAGCATGTCGGCTGCGAGTCT 
GCCAACACACAGGTCCTGCAGGTGAGTGGCAGGTGCCCAGCTCATGTGAGTCTGCATTTGTGTGGGCACA 
GTTACGCACACACACCTTTCCTCCTTCCTCATGTGGGTCCCCAAGAGAACAGCTTCTTCTTGAGACGAGA 
AACCTCATGGGAGGAACCACAGCTTCAACTTCTTTTGTTCCAGCCACCTCTCTCCCGAGTCAAGTAACCC 
TAACAGCCCCCCGACCTTAGACACTTTCACGCTCATTATTGCACTGGCAAGTGCGCTGGCCTGAGAGAGT 

15 CGAGAGCACATACTCTGGGATCAGAAAGTTTTAGTTTGAAACACATCTTTGTTGTCAACCAGCTTTGTGA 
TCATGGGCTGGTCTCTTAACCTCTCTGAGCCTCGTTACCTAAACTTTAAAATGAGAACATATCTTCGGGT 
TGTTGTAAGAATTAAAAGCACTTTAAGACAGTATTTAGCAGATAGCAGGGGCTGAAACAATGGCAGTTAT 
TGCTGTTATAATATGATGTAATAATTATACAATGGTAAAAATTATTATTATTATTATTACCGGTGATTGT 
GAAGTGAAATAAAAGGGAGGGTGGTAGAGCAACTTCCTCTATCTTGAAGTTCTAAGAGGGGAAGCATGGA 

20 GTCTGCATGTATGCAAATCCCAGCCACCTGAAGGGGGAAGGTGCCTTTCCCCACTTGTGGTCCTTTGCTC 
AAATGTGTGTGGTTCCTGCTGGAGGGCCTTGGTGGGAGATCTGCTTTGTCCCACCTCTGCCTACCTGGCT 
GTAGCTGGTTGCCAGGTGGCTCCAAGAGCAATGTCAAAGGCCAGCATGAATAAGGAGCCAAAGTTTTGGG 
TTCACACCCCTGCTCTGCCAGGAGGAAGACTGGTATGCAATGGCCAGTTCCCCTGACTGAGCCCCAGGGA 
ACCTCAAGGAATGGCCCCTACAGTAAAATCTTAAACAAAGAACAGGGGCTTAACCAACAATCCTGCTGCA 

25 ACATGCATTCCAGTGACACACAGAGCTGAACTCAGTGGGAGCCCGTCCTACGGCCTGAGGTCAGGGCTGC 

AGGCTGGAGTGCAATGGTACAGTCCGGCTCACTGCAGTCTCAGCTACCCAGGCTCAAGGGATCCTCCCAC 
CTCAGCCTCCCCAGTAGCTAGGACTACAGGCATGTGCCACCACACCCGGCTGATTTATTGTATTTTTTGT 
AGAGAGGGAGTTTTGCCATGTTGCCCAGGCTGGTCTCAAACTCCTGGACTCAAGCGATCCACCCACCTCG 

30 GCCTCTCAGAGTGCTGGGACTACAGGCATGAGCCACCCACCCGGCCCAGCCTCACACCTGTCTTTAGTGG 
CTTAGCCCGCCCACCTGCCCTCTGCCCTTCCATAGGTGTCTGAGGCCTGTGATGCCGTGTTCGTGGCTGG 
CAAGGAGAGCCGGGGCGCCCGGGGGGTGCGAGTGGACTTCTGGTGGCGCCGGCTCCGCGCCTCGCTGCGG 
CTGACCGTGTGGGCCCCCCTGCTACCGCTGCGTATCGAGCTCACCGACACCACCCTCGAGCAGGTCCGCG 
GCTGGAGGGTACCTGGCCCTGCTGAAGGGTGAGTGGAGGCCTGAGGAAGCTGGCAGGCCTTGGCGAGTCA 

35 CACTGGGACGGAGAGGGCGCAGGGGACACAGCCATGGGCCCAAGTCGGGGTCAGAAAAACTGCCCTGTTG 
CTTGCAGTTCCTTCTGCCCACCTCCACCCTGGGCCGCCCAGCCAATGATCGCTTTCTGTGAAATTGCTGA 
CTTCCGCATCCCTCTCCTCCTTCCCTGCCTCACTTCTCCCGTAGCACCCTCCACTTCCTAATCTACGTCA 
GTACTGGTTAATGTTGCCTGTCCATCCTTTCCACGCTCCTCCCCCGAGAATGTCAGCCCCTTGAAGCTGC 
GGTCTGTCTATGCTGGTCGTACCCTCAGAGCCTAAGATGACACAGAGCTCAGGAAATGAACACCCTCGGA 

40 GGGGGCTCCCCAAAACCCAGGGAGAGGTGGCCCACGCCTCTGGAATCTAGCTTCCAGGCCCTTCAGAGAG 
TGAGCGCCCCCAGGGTCCCCGTAAGAGAGCGTCTCCCCTGCGGTCCACTTGGGAGCAAGAGCGGTGATCG 
AGGCACAGCGCCAGTCGGGAGTGGGCTGGGGCGGCTCGCCTGCAGGTGGAAGGGCCTGACCCCAAGAGGG 
GCGGATGTCTGCGTTTGAGAAACAGGAGCTGCTGGTGATTAGGAGCCGCCGGGGACGAGCTGCGGGTCTC 
CAGC71GCTGGGCCCTGGGCAGGCTGGGGCCGCTCAGCGCTGACGGCCAGTCCCGGCCTCCCCGCCCTGCA 

45 GGCCTGCGGAACCCGCTGCAGAGGCGTCGGATGAGGCCGAGCGGCGCGCCCGTGGCTGCCACCTGCAGTA 
CCAGCGGGCCGGTGTGCGCTTCCTCGCCCCCTTCGCGGCCCACCCGCTGGACGGCGGCCGC 

SEQ ID NO:309 

>gi | 10047241 | dbj | BAB13409.1) KIAA1583 protein [Homo sapiens] 
50 VVRGEGTFTALFSLPCPQLLCPSMAPGSAPASSGAVIISPSYASSVDCGQAPIjDPVyLPAALEIiLDAPEH 
FRVQQVGHYPPANSSLSSRSETFLLliQPWPRAQPLIiRASYPPFATQQWPPRVTEPHQRPVPWDVRAVSV 
EAAVTPAEPYARVLFHLKGQDWPPGSGSLPCARLHATHPAGTAHQACRFQPSLGACWELELPSHWFSQA 
STTRAELAYTIiEPAAEGPGGCGSGEENDPGEQAIiPVGGVELRPADPPQYQEVPLDEAVTIiRVPDMPVRPG 
QLF S ATLLLRHNFTASLLTLR I KVKKGLHVTAARPAQPTL WTAKLDRFKGSRHHTTLI TCHRAGLTEPDS 
55 SPLELSEFLWVDFVVENSTGGGVAVTRPVTWQLEYPGQAPEAEKDKMVWEIL^ 
PSLPGKAGGQVPGAP 
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SEQIDNO:310 

Novel (KIAA1814) 
>GL2_75_2_M13F Direction: sense 

CTGGACGGCCTGGCTGGGCTGAAGGGCGAGGACAGCCGCAGCAAGGAGGCAGGGGAGGGCGGCCTACCGCTGTGC 
5 GGGCCCACGGACAAGACCCCACTGCTGAGCGGCAAGGCCGCCAAGGCCCGGGACCGCGAG 

SEQDDNO:311 

>GL2_J75 — 2_M13R Direction: anti-sense 

CTGCCCATGGCGTGCGAGTTGGTCAGTCTGGTGGCAGCGGACGACACGTGCACCATGGCCTGCGGCGGCCGGCAC 
10 AGAGCTGAACAGCGACTGCAGCACGGAGGAGCCGGCCACGGAGCCGAGGTTGGCCTGCAGGGACATGGGGCCGAG 
CGCGAACTGCGGTCCCGGCGGGAAGGGGCCCAGGAAGGACCGGTGCGTCTGGGCGGAGGACGCCGCGCCGCCTGC 
GGAGGACGCCGCCGGGGCCAGGCCGGGGCCGCTGAGGAGGCTTCCGGGAGGCACGGCCGCCGCAGAGATGAAGAG 
GTTGTGGCCATTCTTGAGGTCGACCTCGCGGTCCCGGGCCTTGGCGGCCTTGCCGCTCAGCAGTGGGGTCTTGTC 
CGTGGGCCCGCACAGNGGTAGGCCGCCTCCCCTGCCTCCTTGCTGCGGCTGTCCTCGCCCTTCAGCCCAGCCAGG 
15 CCGTCCAG 

SEQIDNO:312 

>GIj2__75_3_M13R Direction: anti-sense 

CTGCCCATGGCGTGCGAGTTGGTCAGTCTGGTGGCAGCGGACGACACGTGCACCAGGCCTGCGGGGCGGCACAGA 
20 GCTGAACAGCGACTGCAGCACGGAGGAGCCGGCCACGGAGCCGAGGTTGGCCTGCAGGGGACATGGGGCCGAGCG 
CGAACTGCGGTCCCGGCGGGAAGGGGCCCAGGAAGGACCGGTGCGTCTGGGCGGAGGACGCCGCGCCGCCTGCGG 
AGGACGCCGCCGGGGCCAGGCCGGGGCCGCTGAGGAGGCTTCCGGGAGGCACGGCGCCGCAGAGATGAAGAGGTT 
GTGGCCATTCTTGAGGTCGACT 

25 SEQIDNO:313 

>GL2__81A__D07_G3F1 Direction: N/A 

CTGGACGGCCTGGCTGGGCTGAAGGGCGAGGACAGCCGCAGCAAGGAGGCAGGGGAGGGCGGCCTACCGCTGTGC 
GGGCCCACGGACAAGACCCCACTGCTGAGCGGCAAGGCCGCCAAGGCCCGGGACCGCGAGGTCGACCTCAAGAAT 
GGCCACAACCTCTTCATCTCTGCGGCGGCCGTGCCTCCCGGAAGCCTCCTCAGCGGCCCCGGCCTGGCCCCGGCG 
30 GCGTCCTCCGCAGGCGGCGCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGACCG 
CAGTTCGCGCTCGGCCCCATGTCCCTGCAGGCCAACCTCGGCTCCGTGGCCGGCTCCTCCGTGCTGCAGTCGCTG 
TTCAGCTCTGTGCCGGCCGCCGCAGGCCTGGTGCACGTGTCGTC 

SEQIDNO:314 

35 >GL2__81A_D08_G3F1 Direction: N/A 

CTGGACGGCCTGGCTGGGCTGAAGGGCGAGGACAGCCGCAGCAAGGAGGCAGGGGAGGGCGGCTACCGCTGTGCG 
GGCCCACGGACAAGACCCCACTGCTGAGCGGCAAGGCCGCCAAGGCCCGGGACCGCGAGGTCGACCTCAAGAATG 
GNCACAACCTCTTCATCTCTGCGGCGGCGTGCCTCCCGGAAGCCTCCTCAGCGGCCCGGCCTGGCCCCGGCGGCG 
TCCTCCGCAGGCGGCGCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGACCGCAG 

40 TTCGCGCTCGGCCCATGTCCCTGCAGGCCAACCTCGGCTCCGTGGCCGGCTCCTCCGTGCTGCAGTCGCTGTTCA 
AGCTCTGTGCCTGGCCTGCCGCAGGCCTGGTGCACGTGTCGTCCGCTGCCANACAGACTGACCAACTCGCACGCC 
ATGGGCAG 

SEQIDNO:315 

45 >GL2_81A_D11_G3F1 Direction: N/A 

CTGGACGGCTGGCTGGGCTGAAGGGCGAGGACAGCGCAGCAAGGAGGCAGGGGAGGGCGGCTACCGCTGTGCGGG 
CCCACGGGACAAGACCCGA.CTGCTGAGCGGCAAGGCCGCAAGGCCCGGGACCGCGAGGTCTGACCTCAAGAATGG 
GC^CAACTCTTCATCTCTGCGGCGGNCTGTGCCTCCCGGAAGCCTCCTCAGCGGCCCGGCCTGGCCCCGGCTGGC 
TGTCCTCCGCAGTGCTGGCTGCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGAC 

50 CGCAGTTCGCGCTCGGCCCATGTCCCTGCAGGCCAACTCGGCTCCGTGGCCGGCTCCTCCGTGCTGCAGTCGCTG 
TTCAGCTCTGTGCCGGCCGCCGCAGGCCTGGTGCACGTGTCGTCCGCTGCCACCAGACTGACCAACTCGCACGCC 
ATGGGCAG 

SEQBDNO:316 

55 >GD2__81A_D12__G3F1 Direction: sense 

CTGGACGGCCTGGCTGGGCTGAAGGGCGAGGACAGCCGCAGCAAGGAGGCAGGGGGAGGGCGGCTACCGCTGTGC 
GGGCCCACGGACAAGACCCCACTGCTGAGCGGCAAGGCCGCCAAGGCCCGGGACCGCGAGGTCGACCTCAAGAAT 
GGGCACAACTCTTCATCTCTGCGGCGGCCGTGCCTCCCGGAAGCCTCCTCAGCGGCCCGGCCTGGCCCCGGCGGC 
GTCCTCCGCAGGCGGCGCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGACCGCA 

60 GCTCGCGCTCGGCCCATGTCCCTGCAGGCCAACCTCGGCTCCGTGGCCGGCTCCTCCGTGCTGCAGTCGCTGTTC 
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AGCTCTGTGCCGGCCGCCGCAGGCCTGGTGCACGTGTCGTCCGCTGCCACCAGACTGACCAACTCGCACGCCATG 
GGCAG 

SEQK)NO:317 
5 Clone 75 -Novel (KIAA1814)> 

CTGGACGGCCTGGCTGGGCTGAAGGGCGAGGACAGCCGCAGCAAGGAGGCAGGGGAGGGCGGCCTACCGCTGTGC 
GGGCCCACGGACAAGACCCCACTGCTGAGCGGCAAGGCCGCCAAGGCCCGGGACCGCGAGGTCGACCTCAAGAAT 
GGCCACAACCTCTTCATCTCTGCGGCGGCCGTGCCTCCCGGAAGCCTCCTCAGCGGCCCGGCCTGGCCCCGGCGG 
CGTCCTCCGCAGGCGGCGCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGACCGC 
10 AGTTCGCGCTCGGCCCCATGTCCCTGCAGGCCAACCTCGGCTCCGTGGCCGGCTCCTCCGTGCTGCAGTCGCTGT 
TCAGCTCTGTGCCGGCCGCCGCAGGCCTGGTGCACGTGTCGTCCGCTGCCACCAGACTGACCAACTCGCACGCCA 
TGGGCAG 

SEQIDNO:318 

15 >gi (22053031 1 ref |XM_046822.3 | Homo sapiens KIAA1814 protein (KIAA1814) , 
mRNA 

GCGGCCGTCCACTGGACTCCTGCGCCATATCCTGCAGCAGGTCTACAACCACTCGGTGACCGACCCCGAG 
AAGCTCAACAACTACGAGCCCTTCTCCCCCGAGGTGTACGGGGAGACCTCCTTCGACCTGGTGGCCCAGA 
TGATTGATGAGATCAAGATGACCGACGACGACCTGTTTGTGGACTTGGGGAGCGGTGTGGGCCAGGTCGT 

20 GCTCCAGGTTGCTGCTGCCACCAACTGCAAACATCACTATGGCGTCGAGAAAGCAGACATCCCGGCCAAG 
TATGCGGAGACCATGGACCGCGAGTTCAGGAAGTGGATGAAATGGTATGGAAAAAAGCATGCAGAATACA 
CATTGGAGAGAGGCGATTTCCTCTCAGAAGAGTGGAGGGAGCGAATCGCCAACACGAGTGTTATATTTGT 
GAATAATTTTGCCTTTGGTCCTGAGGTGGATCACCAGCTGAAGGAGCGGTTTGCAAACATGAAGGAAGGT 
GGCAGAATCGTGTCCTCGAAACCCTTTGCACCTCTGAACTTCAGAATAAACAGTAGAAACTTGAGTGACA 

25 TCGGCACCATCATGCGCGTGGTGGAGCTCTCGCCCCTGAAGGGCTCGGTGTCGTGGACGGGGAAGCCAGT 
CTCCTACTACCTGCACACTATCGACCGCACCATACTTGAAAACTATTTTTCTAGTCTGAAAAACCCAAAA 
CTCAGGGAGGAACAGGAGGCAGCCCGGCGCCGCCAGCAGCGCGAGAGCAAGAGCAACGCGGCCACGCCCA 
CTAAGGGCCCAGAGGGCAAGGTGGCCGGCCCCGCCGACGCCCCCATGGACTCTGGTGCTGAGGAAGAGAA 
GGCGGGAGCAGCCACCGTGAAGAAGCCGTCTCCCTCCAAAGCCCGCAAGAAGAAGCTAAACAAGAAGGGG 

30 AGGAAGATGGCTGGCCGCAAGCGCGGGCGCCCCAAGAAGATGAACACTGCGAACCCCGAGCGGAAGCCCA 
z \ AGAAGAACCAAACTGCACTGGATGCCCTGCACGCTCAGACCGTGTCTCAGACGGCGGCCTCCTCACCCCA 
GGATGCCTACAGATCCCCTCACAGCCCGTTCTACCAGCTACCTCCGAGCGTGCAGCGGCACTCCCCCAAC 
CCGCTGCTGGTGGCGCCCACCCCGCCCGCGCTGCAGAAGCTTCTAGAGTCCTTCAAGATCCAGTACCTGC 
AGTTCCTGGCATACACAAAGACCCCCCAGTACAAGGCCAGCCTGCAGGAGCTGCTGGGCCAGGAGAAGGA 

35 GAAGAACGCCCAGCTCCTGGGTGCGGCTCAGCAGCTCCTCAGCCACTGCCAGGCCCAGAAGGAGGAGATC 
AGGAGGCTGTTTCAGCAAAAATTGGATGAGCTGGGTGTGAAGGCGCTGACCTACAACGACCTGATTCAAG 
CGCAGAAGGAGATCTCCGCCCATAACCAGCAGCTGCGGGAGCAGTCGGAGCAGCTGGAGCAGGACAACCG 
CGCGCTCCGCGGCCAGAGCTTGCAGCTGCTCAAGGCTCGCTGCGAGGAGCTGCAGCTGGACTGGGCCACG 
CTGTGGCTGGAGAAGCTGTTGAAGGAGAAGCAGGCCCTGAAGAGCCAGATCTCGGAGAAGCAGAGGCACT 

40 GCCTGGAGCTGCAGATCAGCATTGTGGAGCTAGAGAAGAGCCAGCGGCAGCAGGAGCTCCTGCAGCTCAA 
GTCCTGTGTGCCGCCTGACGACGCCCTGTCCCTGCACCTGCGTGGGAAGGGCGCCCTGGGCCGCGAGCTG 
GAGCCTGACGCCAGCCGGCTGCACCTGGAGCTGGACTGCACCAAGTTCTCGCTGCCTCACTTGAGCAGCA 
TGAGCCCGGAGCTCTCCATGAACGGCCAGGCTGCTGGCTATGAGCTCTGCGGTGTGCTGAGCCGGCCTTC 
GTCGAAGCAGAACACGCCCCAGTACCTGGCCTCACCCCTGGACCAGGAGGTGGTGCCCTGTACCCCTAGC 

45 CACGTCGGCCGGCCGCGCCTGGAGAAGCTGTCTGGCCTAGCCGCACCCGACTACACTAGGCTGTCCCCGG 
CCAAGATTGTGCTGAGGCGGCACCTGAGCCAGGACCACACGGTGCCCGGCAGGCCGGCTGCCAGTGAGCT 
GCATTCGAGAGCTGAGCACACCAAGGAGAACGGCCTTCCCTACCAGAGCCCCAGCGTGCCTGGCAGCATG 
AAGCTGAGCCCTCAGGACCCGCGGCCCCTGTCCCCTGGGGCCTTGCAGCTTGCTGGAGAGAAGAGCAGTG 
AGAAGGGCCTGAGAGAGCGCGCCTACGGCAGCAGCGGGGAGCTCATCACCAGCCTGCCCATCAGCATCCC 

50 GCTCAGCACCGTGCAGCCCAACAAGCTCCCGGTCAGCATTCCCCTGGCCAGCGTGGTGCTGCCCAGCCGC 
GCCGAGAGGGCGAGGAGCACCCCCAGTCCCGTGCTGCAGCCCCGTGACCCCTCGTCCACACTTGAAAAGC 
AGATTGGTGCTAATGCCCACGGTGCTGGGAGCAGAAGCCTTGCCCTGGCCCCCGCAGGCTTCTCCTACGC 
TGGCTCGGTGGCCATCAGCGGGGCCTTGGCGGGCAGCCCGGCCTCTCTCACACCTGGAGCCGAGCCGGCC 
ACCTTGGATGAGTCCTCCAGCTCTGGGAGCCTTTTTGCCACCGTGGGGTCCCGCAGCTCCACGCCACAGC 

55 ACCCCCTGCTGCTGGCACAGCCCCGGAACTCGCTTCCTGCCTCTCCCGCCCACCAGCTCTCCTCCAGTCC 
CCGGCTTGGTGGGGCCGCCCAGGGCCCGTTGCCCGAGGCCAGCAAGGGAGACCTGCCCTCCGATTCCGGC 
TTCTCAGATCCTGAGAGTGAAGCCAAGAGGAGGATTGTGTTCACCATCACCACTGGTGCGGGCAGTGCCA 
AGCAGTCGCCCTCCAGCAAGCACAGCCCCCTGACCGCCAGCGCCCGTGGGGACTGTGTGCCGAGCCACGG 
GCAGGACAGTCGCAGGCGCGGCCGGCGGAAGCGAGCATCTGCGGGGACGCCCAGCTTGAGCGCAGGCGTG 

60 TCCCCCAAGCGCCGAGCCCTGCCGTCCGTCGCTGGCCTTTTCACACAGCCTTCGGGGTCTCCCCTCAACC 
TCAACTCCATGGTCAGTAACATCAACCAGCCCCTGGAGATTACAGCCATCTCGTCCCCGGAGACCTCCCT 
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GAAGAGCTCCCCTGTGCCCTACCAGGACCACGACCAGCCCCCCGTGCTCAAGAAGGAGCGGCCTCTGAGC 
CAGACCAATGGGGCACACTACTCCCCACTCACCTCAGACGAGGAGCCAGGCTCTGAGGACGAGCCCAGCA 
GTGCTCGAATTGAGAGAAAAATTGCAACAATCTCCTTAGAAAGCAAATCTCCCCCGAAAACCTTGGAAAA 
TGGTGGTGGCTTGGCGGGAAGGAAGCCCGCGCCCGCCGGCGAGCCAGTCAATAGCAGCAAGTGGAAGTCC 
5 ACCTTCTCGCCCATCTCCGACATCGGCCTGGCCAAGTCGGCGGACAGCCCGCTGCAGGCCAGCTCCGCCC 
TCAGCCAGAACTCCCTGTTCACGTTCCGGCCCGCCCTGGAGGAGCCCTCTGCCGATGCCAAGCTGGCCGC 
TCACCCCAGGAAAGGCTTTCCCGGCTCCCTGTCGGGGGCTGACGGACTCAGCCCGGGCACCAACCCTGCC 
AACGGCTGCACCTTCGGCGGGGGCCTGGCCGCGGACCTGAGTTTACACAGCTTCAGTGATGGTGCTTCTC 
TTCCCCACAAGGGCCCCGAGGCGGCCGGCCTGAGCTCCCCGCTGAGCTTCCCCTCGCAGCGCGGCAAGGA 

10 GGGCTCGGACGCCAACCCTTTCCTGAGCAAGAGGCAGCTGGACGGCCTGGCTGGGCTGAAGGGCGAGGGC 
AGCCGCGGCAAGGAGGCAGGGGAGGGCGGCCTACCGCTGTGCGGGCCCACGGACAAGACCCCACTGCTGA 
GCGGCAAGGCCGCCAAGGCCCGGGACCGCGAGGTCGACCTCAAGAATGGCCACAACCTCTTCATCTCTGC 
GGCGGCCGTGCCTCCCGGAAGCCTCCTCAGCGGCCCCGGCCTGGCCCCGGCGGCGTCCTCCGCAGGCGGC 
GCGGCGTCCTCCGCCCAGACGCACCGGTCCTTCCTGGGCCCCTTCCCGCCGGGACCGCAGTTCGCGCTCG 

15 GCCCCATGTCCCTGCAGGCCAACCTCGGCTCCGTGGCCGGCTCCTCCGTGCTGCAGTCGCTGTTCAGCTC 
TGTGCCGGCCGCCGCAGGCCTGGTGCACGTGTCGTCCGCTGCCACCAGACTGACCAACTCGCACGCCATG 
GGCAGCTTTTCCGGGGTGGCAGGCGGCACAGTTGGAGGTGTCTTTAACCACGCGGTGCCCTCCGCCTCTG 
CTCATCCGTTTGGAGCCCGTGTCGGCCGCGGGGCTGCATGTGGCAGCGCCACACTGGGCCCGAGCCCGCT 
GCAGGCGGCGGCCAGCGCCTCGGCCTCTTCCTTTCAGGCCCCGGCCTCGGTTGAGACCCGGCCGCCCCCT 

20 CCGCCTCCGCCTCCGCCTCCCCCGCTGCCCCCGCCTGCGCACCTGGGCCGGTCCCCCGCGGGGCCGCCCG 
TCCTCCACGCCCCCCCTCCACCTAACGCCGCCTTGCCTCCTCCCCCAACGCTGCTGGCCTCTAACCCTGA 
GCCCGCGCTTCTGCAGAGCCTCGCGTCCCTCCCGCCTAACCAAGCTTTCTTGCCCCCCACCTCTGCTGCC 
TCTCTGCCGCCTGCTAACGCCTCTTTGTCTATCAAGCTCACCTCCCTCCCGCACAAGGGCGCCCGCCCCT 
CCTTCACGGTGCACCACCAGCCCCTGCCCCGGCTGGCCCTGGCCCAGGCCGCGCCCGGGATCCCACAGGC 

25 CAGCGCCACGGGGCCGTCCGCGGTGTGGGTGTCCCTCGGCATGCCGCCTCCCTATGCCGCGCACCTTTCG 
GGGGTTAAGCCGCGATAAAGACCTTGCTTAGCTAGCAGTGCGTATTGTGTAAGGTAAGGCCAGAGCCCTG 
CGCGGTGGCTCACTCCAGACACTGAACTGTCCTTCCTTTGGCAGAGAAGACGGCCACAGGGCTCGGCAGA 
AGTTCCGGGGGTCCTGAGGAGGGAAGAGGGAGAGAGCCAGGGAGACCAGGGAGATGGTCGCGAGAGGAGG 
GACTACAGGACGGGCACCAGCCATGGAGTGGCCTGAGTGTGTTGGGTTCCAGCTGCCCGCAGCTGGGTTC 

30 CTGCGGTGACGCCGCAGGACTGAGGCCAGCCCTGGGGGCAGCTGTGCCAGCCCCTGTGGGCGAGGCTCAC 
GGGGTGCCAGGCCAGGGGGCTGGGAGCTCTAGCTCCTAGAGCAGTCTCCAGGGCTCCATCCGCACCAGGC 
CCAGGTCACTCATGACGGGTGGCGTGGCTTGGTGCTGTTCCCCAGGCGCCCAGCATGTAGCAGGCACGGT 
GCCGGCTGCAGAGGTCCTGGAGAGCCAGGGTGGCTGGCTGGATGCCTCTGGTCGGGGCTGTCTGGCCAGT 
AGCCTCGGATGCCCTCATAGGGAGCCAGTGAAGCCCCAAGTGCCCTGCCTGCCTGGGGGCTGATGGGTTC 

35 CCGGCGGGGTCGAGAGGCAAGCTCTGTGCCCTGCGTGTTGAGGCCCCCTTGCTGGACACGGCTGATGGCC 
ACACTGCCCTGTGGTCATGGCCCACAGCCGAGGGGCTTTCAGCTGCTCAGATGTCAGCACCAAGCTGGGC 
AGGTTAGTGTAGACGGTGCCCACCTTTGAGACCAGAAGGAAGTTGGAAGCAGGAGTGATTTTGATGATGT 
GAGGCCTTCAGGGAATGTCCTGTTCACTCCTCCCCCAAGAGGCTGTGGAGACTGTGGCCGCTGACCCCTG 
AGGGCAGTTGGCCACCCGTGCCCTCCAGGGACATGGCGTGCCTGGCGGCGTAGTGCAAAGGTGCCCTCTG 

40 CTTGCCCCTGGCCTTCTGCCTCAGGGGCCCCTGGGACACAGGCCTTCCTGGGCGAGTCCAAAGCCCACCG 
GGCAAGGAGAGGAGGACTGGGAGCCAGGAGCCAAGGCGGAGGCTGTGGCCAGGGCTGGTCAGCCTGGCGG 
GTCAATGCGGGCACCTGCGGGCTGGACAAGCTATGCTGGGTCTCTTAGGAGATGAGCTGCAGGTAGGGTG 
GCTTTAGCTGGACTCGGCTGTGTGTCTCCTTTGGAGGAAGGCTGGAGCTTGGCCGGCGTGTCCAGGTGTC 
AGGCTCCCTGGTCTGTCACGGGGCACGCCCGTCGTCTGTTGTGGTTAGAGGAGCTGGCACTATGCCTGTC 

45 ATGGTGCTGGCCCCAGGTGGCGTGTGTGCTCGTGGGAGGCCTCGGTCCCCAGCCTGGGTGAGTGTTGGGC 
TCCCCCAGGCGCGATGGTAACCTCAGGCCGCTCTTCCCGCTGTGCCCTTCTGCAGGTAACTAGGATTTCT 
ACCTCAACCGCGAGACCTATGCAAGGACGGTGTGGACCAACTCGCGCCCGCGGCATGGTGCCCGCCGGCC 
TGCCGGGCTCCCACCCCTGGACGGCAGAGGCAAGGACGGACGGGAGCTCCACTGTGAATCGGCGGCACGC 
GCCGCAGGAGGCTGGGACTGGTCCAGTTTGTACTGTCGATAGTTTTAGATAAAGTATTTATCATTTTTT 



SEQIDNO:319>gi|22053032|ref |XP_046822.3| similar to FLJ00192 protein 
[Homo sapiens] 

MIDEIKMTDDDLFVDLGSGVGQVVLQVAA 

TIjERGDFLSEEWRERIAlTOSVIFVl^FAFGPEVDHQIiKERFANMKEGGRIVSSKPFAPL 
55 IGTIMRVVELSPLKGSVSWTGKPVSYYLHTIDRTILI^ 

TKGPEGKVAGPADAPMDSGAEEEKAGAATVKKP S P S KARKKKIjNKKGRKMAGRKRGRPKKMNTANPERKP 

KKNQTALDALHAQTV S QTAAS S PQDAYRS PHS PFYQIi PPS VQRHS PNPLLVAPTPPALQKLLE S FKIQYL 

QFLAYTKTPQYKASLQELLGQEKEKNAQLLGAAQQL^ 

AQKE I S AHNQQLREQS EQLEQDNTIALRGQSLQLLKARC^ 
60 CLELQISIVEIiEKSQRQQELIiQLKSCVPPDDALSLH^ 

MS PELSMNGQAAGYEIiCGVIiSRPS SKQNTPQYIxAS PLDQEWPCTPSHVGRPRLEKLSGLAAPDYTRLS P 
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AKlVLRRHLSQDHTVPGRPAASELHSRAEHTKENGLPYQSPSVPGSMKLSPQDPRPLSPGArjQLAGEKSS 
EKGLRERAYGSSGELITSLPISIPLSTVQPNKLPVSIPLASWLPSRAER^STPSPVLQPRDPSSTIiEK 
QlGANAHGAGSRSLALAPAGFSYAGSVAISGAIiAGSPASLTPGAEPATLDESSSSGSLFATVGSRSSTPQ 
HPLLLAQPRNSLPASPAHQLSSSPRLGGAAQGPLPEASKGDLPSDSGFSDPESEAKRRIVFTITTGAGSA 
5 KQSPSSKHSPLTASARGDCVPSHGQDSRRRGRRKRASAGTPSLSAGVSPKRRALPSVAGLFTQPSGSPLN 
LNSMVSNINQPLE ITAI S S PETSLKS S P VP YQDHDQPPVLKKERPLSQTNGAHYSPLTSDEEPGSEDEP S 
SARI ERKIAT ISLESKS PPKTLENGGGLAGRKPAPAGE PVNS S KWKS TFS PISD IGLAKSADS PLQAS S A 
LSQNSLFTFRPALEEPSADAKLAAHPRKGFPGSLSGADGLSPGTNPANGCTFGGGLAADLSLHSFSDGAS 
LPHKGPEAAGLSSPLSFPSQRGKEGSDANPFLSKRQLDGLAGLKGEGSRGKEAGEGGLPLCGPTDKTPLL 
10 SGKAAKARDREVDLKNGHNLFISAAAVPPGS 

GPMSLQANLGSVAGSSVLQSLFSSVPAAAGLVHVSSAATRLTNSI^ 

AHPFGARVGRGAACGSATLGPSPLQAAASASASSFQAPASVETRPPPPPPPPPPPIiPPPAHIiGRSPAGPP 
VLHAPPPPNAALPPPPTLLASNPEPALLQSLASLPPNQAFL^^ 

S FTVHHQPLPRIiAIiAQAAPG I PQASATGPS AVWVSLGMP P P YAAHL S GVKPRB o 1 t om of Form 

15 

SEQ1DNO:320 

Novel (maps to chromosome 4) 
Novel (Chromosome 4) > 

GGAATAAAAGAGTGGAAATGGGGATTTCCAGGTGCTCCCCTGGTTCATCTAGGCACCAGAGAGCTGCACTAGCAG 
20 GTCTATCATGAATCTCCTTGGAATGCTCATTTTTAGTCCTACTTGATGTGTCTGTTTCTGGAAATGCAGTATTTT 
TAATGTATCTCAACAAAATATTTTATGATTAGTAAGCTTATTCTTATATAAAGGACAATTTTTTTCCTTTTTCAC 
AGGTTCTAATAATTTTTTATTTAATAATTAGATCTATTAGATTTTATTCATAACTGTGGTAGTTGAAGTACCTTC 
TAAGCTGAGTTCAGATTTGAGAATAAACCTTGGGGTATCATTACAGAAAATTTTGTCTCAATCTGCTTTGTATTT 
GAAAGATATGAGATTCTTGAATTATATATCTTACAGACTAGTCCCCAAAAGAATACGTGTTTCCTTACCTTTAAT 
25 TTCTCATGGTAGTTAGTCTGTGAAT 

SEQIDNO:321 

Novel peroxidasin-like/ melanoma antigen 
>GL2-86-4M13F Direction: sense 

30 CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGC 
CACCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCG 
CGCCAACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCT 
GCTCAAGCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCA 
GCACATCACCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGG 

35 CTACGACCCCGGCATCAATGCTGGCATCTTCAACGCCTTCGCCACCGCGGCCTTCAGGTTTGGCCACACGCTTGT 
CAACCCACTGCTTTACCGGCTGGACGAGAACTTCCAGCCCATTGCACAAGATCACCTCCCCCTTCACAAAGCTTT 
CTTCTCTCCCTTCCGGATTG 

SEQIDNO:322 

40 >GL2_93_2B08J33F1 Direction: sense 

CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGC 
CACCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCG 
CGCCAACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCT 
GCTCAAGCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCA 

45 GCACATCACCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGG 
CTACGACCCCGGCATCAATGCTGGCATCTTCAACGCCTTCGCCACCGC 

SEQIDNO:323 

Novel peroxidasin-like/ melanoma antigen> 

50 CGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCATCGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGC 
CACCGGGCCGCCCACGGAGTGCATGCGGGACGAGAACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCG 
CGCCAACGAGCAGCTGGGCCTGACCAGCATGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCT 
GCTCAAGCTGAACCCGCACTGGGACGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCA 
GCACATCACCTACCAGCACTGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGG 

5 5 CTACGACCCCGGCATCAATGCTGGC ATCTTCAACGCCTTCGCCACCGCGGCCTTCAGGTTTGGCCACACGCTTGT 
CAACCCACTGCTTTACCGGCTGGACGAGAACTTCCAGCCCATTGCACAAGATCACCTCCCCCTTCACAAAGCTTT 
CTTCTCTCCCTTCCGGATTGTGAATGAGGGCGGCATCGATCCGCTTCTCAGGGGGCTGTTCGGGGTGGCGGGGAA 
AATGCGTGTGCCCTCGCAGCTGCTGAACACGGAGCTCACGGAGCGGCTGTTCTCCATGGCACACACGGTGGCTCT 
GGACCTGGCGGCCATCAACATCCAGCGGGGCCGGGACCACGGGATCCCACCCTACCACGACTACAGGGTCTACTG 

60 CAATCTATCGGCGGCACACACGTTCGAGGACCTGAAAAATGAGATTAAAAACCCTGAGATCCGG 
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SEQIDNO:324 

>gi | 20533845 | ref |XM__056455 . 2 | Homo sapiens Melanoma associated gene 
(D2S448), mRNA 

5 AGCCGGCCGTGGTGGCTCCGTGCGTCCGAGCGTCCGTCCGCGCCGTCGGCCATGGCCAAGCGCTCCAGGG 
GCCCCGGGCGCCGCTGCCTGTTGGCGCTCGTGCTGTTCTGCGCCTGGGGGACGCTGGCCGTGGTGGCCCA 
GAAGCCGGGCGCAGGGTGTCCGAGCCGCTGCCTGTGCTTCCGCACCACCGTGCGCTGCATGCATCTGCTG 
CTGGAGGCCGTGCCCGCCGTGGCGCCGCAGACCTCCATCCTAGATCTTCGCTTTAACAGAATCAGAGAGA 
TCCAACCTGGGGCATTCAGGCGGCTGAGGAACTTGAACACATTGCTTCTCAATAATAATCAGATCAAGAG 

10 GATACCTAGTGGAGCATTTGAAGACTTGGAAAATTTAAAATATCTCTATCTGTACAAGAATGAGATCCAG 
TCAATTGACAGGCAAGCATTTAAGGGACTTGCCTCTCTAGAGCAACTATACCTGCACTTTAATCAGATAG 
AAACTTTGGACCCAGATTCGTTCCAGCATCTCCCGAAGCTCGAGAGGCTATTTTTGCATAACAACCGGAT 
TACACATTTAGTTCCAGGGACATTTAATCACTTGGAATCTATGAAGAGATTGCGACTGGACTCAAACACA 
CTTCACTGCGACTGTGAAATCCTGTGGTTGGCGGATTTGCTGAAAACCTACGCGGAGTCGGGGAACGCGC 

15 AGGCAGCGGCCATCTGTGAATATCCCAGACGCATCCAGGGACGCTCAGTGGCAACCATCACCCCGGAAGA 
GCTGAACTGTGAAAGGCCCCGGATCACCTCCGAGCCCCAGGACGCAGATGTGACCTCGGGGAACACCGTG 
TACTTCACCTGCAGAGCCGAAGGCAACCCCAAGCCTGAGATCATCTGGCTGCGAAACAATAATGAGCTGA 
GCATGAAGACAGATTCCCGCCTAAACTTGCTGGACGATGGGACCCTGATGATCCAGAACACACAGGAGAC 
AGACCAGGGTATCTACCAGTGCATGGCAAAGAACGTGGCCGGAGAGGTGAAGACGCAAGAGGTGACCCTC 

20 AGGTACTTCGGGTCTCCAGCTCGACCCACTTTTGTAATCCAGCCACAGAATACAGAGGTGCTGGTTGGGG 
AGAGCGTCACGCTGGAGTGCAGCGCCACAGGCCACCCCCCGCCGCGGATCTCCTGGACGAGAGGTGACCG 
CACACCCTTGCCAGTTGACCCGCGGGTGAACATCACGCCTTCTGGCGGGCTTTACATACAGAACGTCGTA 
CAGGGGGACAGCGGAGAGTATGCGTGCTCTGCGACCAACAACATTGACAGCGTCCATGCCACCGCTTTCA 
TCATCGTCCAGGCTCTTCCTCAGTTCACTGTGACGCCTCAGGACAGAGTCGTTATTGAGGGCCAGACCGT 

25 GGATTTCCAGTGTGAAGCCAAGGGCAACCCGCCGCCCGTCATCGCCTGGACCAAGGGAGGGAGCCAGCTC 
TCCGTGGACCGGCGGCACCTGGTCCTGTCATCGGGAACACTTAGAATCTCTGGTGTTGCCCTCCACGACC 
AGGGCCAGTACGAATGCCAGGCTGTCAACATCATCGGCTCCCAGAAGGTCGTGGCCCACCTGACTGTGCA 
GCCCAGAGTCACCCCAGTGTTTGCCAGCATTCCCAGCGACACAACAGTGGAGGTGGGCGCCAATGTGCAG 
CTCCCGTGCAGCTCCCAGGGCGAGCCCGAGCCAGCCATCACCTGGAACAAGGATGGGGTTCAGGTGACAG 

30 AAAGTGGAAAATTTCACATCAGCCCTGAAGGATTCTTGACCATCAATGACGTTGGCCCTGCAGACGCAGG 
TCGCTATGAGTGTGTGGCCCGGAACACCATTGGGTCGGCCTCGGTGAGCATGGTGCTCAGTGTGAATGTT 
CCTGACGTCAGTCGAAATGGAGATCCGTTTGTAGCTACCTCCATCGTGGAAGCGATTGCGACTGTTGACA 
GAGCTATAAACTCAACCCGAACACATTTGTTTGACAGCCGTCCTCGTTCTCCAAATGATTTGCTGGCCTT 
GTTCCGGTATCCGAGGGATCCTTACACAGTTGAACAGGCACGGGCGGGAGAAATCTTTGAACGGACATTG 

35 CAGCTCATTCAGGAGCATGTACAGCATGGCTTGATGGTCGACCTCAACGGAACAAGTTACCACTACAACG 
ACCTGGTGTCTCCACAGTACCTGAACCTCATCGCAAACCTGTCGGGCTGTACCGCCCACCGGCGCGTGAA 
CAACTGCTCGGACATGTGCTTCCACCAGAAGTACCGGACGCACGACGGCACCTGTAACAACCTGCAGCAC 
CCCATGTGGGGCGCCTCGCTGACCGCCTTCGAGCGCCTGCTGAAATCCGTGTACGAGAATGGCTTCAACA 
CCCCTCGGGGCATCAACCCCCACCGACTGTACAACGGGCACGCCCTTCCCATGCCGCGCCTGGTGTCCAC 

40 CACCCTGATCGGGACGGAGACCGTCACACCCGACGAGCAGTTCACCCACATGCTGATGCAGTGGGGCCAG 
TTCCTGGACCACGACCTCGACTCCACGGTGGTGGCCCTGAGCCAGGCACGCTTCTCCGACGGACAGCACT 
GCAGCAACGTGTGCAGCAACGACCCCCCCTGCTTCTCTGTCATGATCCCCCCCAATGACTCCCGGGCCAG 
GAGCGGGGCCCGCTGCATGTTCTTCGTGCGCTCCAGCCCTGTGTGCGGCAGCGGCATGACTTCGCTGCTC 
ATGAACTCCGTGTACCCGCGGGAGCAGATCAACCAGCTCACCTCCTACATAGACGCATCCAACGTGTACG 

45 GGAGCACGGAGCATGAGGCCCGCAGCATCCGCGACCTGGCCAGCCACCGCGGCCTGCTGCGGCAGGGCAT 
CGTGCAGCGGTCCGGGAAGCCGCTGCTCCCCTTCGCCACCGGGCCGCCCACGGAGTGCATGCGGGACGAG 
AACGAGAGCCCCATCCCCTGCTTCCTGGCCGGGGACCACCGCGCCAACGAGCAGCTGGGCCTGACCAGCA 
TGCACACGCTGTGGTTCCGCGAGCACAACCGCATTGCCACGGAGCTGCTCAAGCTGAACCCGCACTGGGA 
CGGCGACACCATCTACTATGAGACCAGGAAGATCGTGGGTGCGGAGATCCAGCACATCACCTACCAGCAC 

50 TGGCTCCCGAAGATCCTGGGGGAGGTGGGCATGAGGACGCTGGGAGAGTACCACGGCTACGACCCCGGCA 
TCAATGCTGGCATCTTCAACGCCTTCGCCACCGCGGCCTTCAGGTTTGGCCACACGCTTGTCAACCCACT 
GCTTTACCGGCTGGACGAGAACTTCCAGCCCATTGCACAAGATCACCTCCCCCTTCACAAAGCTTTCTTC 
TCTCCCTTCCGGATTGTGAATGAGGGCGGCATCGATCCGCTTCTCAGGGGGCTGTTCGGGGTGGCGGGGA 
AAATGCGTGTGCCCTCGCAGCTGCTGAACACGGAGCTCACGGAGCGGCTGTTCTCCATGGCACACACGGT 

55 GGCTCTGGACCTGGCGGCCATCAACATCCAGCGGGGCCGGGACCACGGGATCCCACCCTACCACGACTAC 
AGGGTCTACTGCAATCTATCGGCGGCACACACGTTCGAGGACCTGAAAAATGAGATTAAAAACCCTGAGA 
TCCGGGAGAAACTGAAAAGGTTGTATGGCTCGACACTCAACATCGACCTGTTTCCGGCGCTCGTGGTGGA 
GGACCTGGTGCCTGGCAGCCGGCTGGGCCCCACCCTGATGTGTCTTCTCAGCACACAGTTCAAGCGCCTG 
CGAGATGGGGACAGGTTGTGGTATGAGAACCCTGGGGTGTTCTCCCCGGCCCAGCTGACTCAGATCAAGC 

60 AGACGTCGCTGGCCAGGATCCTATGCGAGAACGCGGACAA^^ 

GGTGGCGGAGTTCCCTCACGGCTACGGCAGCTGTGACGAGATCCCCAGGGTAGACCTCCGGGTGTGGCAG 
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GACTGCTGTGAAGACTGTAGGACCAGGGGGCAGTTCAATGCCTTTTCCTATCATTTCCGAGGCAGACGGT 
CTCTTGAGTTCAGCTACCAGGAGGACAAGCCGACCAAGAAAACAAGACCACGGAAAATACCCAGTGTTGG 
GAGACAGGGGGAACATCTCAGCAACAGCACCTCAGCCTTCAGCACACGCTCAGATGCATCTGGGACAAAT 
GACTTCAGAGAGTTTGTTCTGGAAATGCAGAAGACCATCACAGACCTCAGAACACAGATAAAGAAACTTG 
5 AATCACGGCTCAGTACCACAGAGTGCGTGGATGCCGGGGGCGAATCTCACGCCAACAACACCAAGTGGAA 
AAAAGATGCATGCACCATTTGTGAATGCAAAGACGGGCAGGTCACCTGCTTCGTGGAAGCTTGCCCCCCT 
GCCACCTGTGCTGTCCCCGTGAACATCCCAGGGGCCTGCTGTCCAGTCTGCTTACAGAAGAGGGCGGAGG 
AAAAGCCCTAGGCTCCTGGGAGGCTCCTCAGAGTTTGTCTGCTGTGCCATCGTGAGATCGGGTGGCCGAT 
GGCAGGGAGCTGCGGACTGCAGACCAGGAAACACCCAGAACTCGTGACATTTCATGACAACGTCCAGCTG 

10 GTGCTGTTACAGAAGGCAGTGCAGGAGGCTTCCAACCAGAGCATCTGCGGAGAAGGAGGCACAGCAGGTG 
CCTGAAGGGAAGCAGGCAGGAGTCCTAGCTTCACGTTAGACTTCTCAGGTTTTTATTTAATTCTTTTAAA 
ATGAAAAATTGGTGCTACTATTAAATTGCACAGTTGAATCATTTAGGCGCCTAAATTGATTTTGCCTCCC 
AACACCATTTCTTTTTAAATAAAGCAGGATACCTCTATATGTCAGCCTTGCCTTGTTCAGATGCCAGGAG 
CCGGCAGACCTGTCACCCGCAGGTGGGGTGAGTCTTGGAGCTGCCAGAGGGGCTCACCGAAATCGGGGTT 

15 CCATCACAAGCTATGTTTAAAAAGAAAATTGGTGTTTGGCAAACGGAACAGAACCTTTGATGAGAGCGTT 
CACAGGGACACTGTCTGGGGGTGCAGTGCAAGCCCCCGGCCTCTTCCCTGGGAACCTCTGAACTCCTCCT 
TCCTCTGGGCTCTCTGTAACATTTCACCACACGTCAGCATCTAATCCCAAGACAAACATTCCCGCTGCTC 
GAAGCAGCTGTATAGCCTGTGACTCTCCGTGTGTCAGCTCCTTCCACACCTGATTAGAACATTCATAAGC 
CACATTTAGAAACAGGTTTGCTTTCAGCTGTCACTTGCACACATACTGCCTAGTTGTGAACCAAATGTGA 

20 AAAAACCTCCTTCATCCCATTGTGTATCTGATACCTGCCGAGGGCCAAGGGTGTGTGTTGACAACGCCGC 
TCCCAGCCGGCCCTGGTTGCGTCCACGTCCTGAACAAGAGCCGCTTCCGGATGGCTCTTCCCAAGGGAGG 
AGGAGCTCAAGTGTCGGGAACTGTCTAACTTCAGGTTGTGTGAGTGCGTTAAAAAAAAAAAAAAAAAAGA 
ATCCCTATACCTCATTTGTATTTTTAAAATGCGTGATGTTTTATGAAATTGTGTCCATTTTTTAGGTATT 
AGATATGGCAGAAAAACCATrTCCACTATGCAAAGTTCTTTTAGACGTCAGTGAAAATCAACTCTCATAC 

25 CTCATGGTCTCTCTTTAATTGACCAAAACCTTCCATTTTTCTCTAAATACAAAGCGATCTGTGTTCTGAG 
CAACCTTTCCCCGAACACACAGCTTCAGTGCAGCACGCTGACCTGAGTATCCACCATGTGCCAGGCACAG 
TGCTGGGCACACGAGGCACCAAGGTCCGGGCCACCTGCCCGCAGCAAGGCCCAGCTGAGGTGGTGGAGGG 
AGCCCCTGAGGTCAGGGGCCGTTTCGGTTCAGGGTGGCAGGTGTCCAGCACTGGGGTATGGCGTCGAGGC 
TTCCATGGGGTGGGGGAGGCCAGCTTCCTTCTGACAGGATGGGCGCATACAGTGCCTGGTGTGATTTGTG 

30 CACAACCCGTGTTCCAGGTGCACATCCTCCCAAGGAGACACCCAGACCCTTCCAGCACGGGCCGGCCAAG 
TTGCTGCGGCGGAGGCAGCATTTCAGCTGTGAGGAAGGTCATTGGATTCATGTGTTTTATCTGTAAAAAT 
GGTTGTCTTAACTTCTTAACCTCATATTGGTAAGTGATTGATAAAAATTGGTTGGTGTTTCATGACATGT 
GGACTTCTTTTGAAATAGCAAGTCAAATGTAGTGACCAAATTGTGGAAGAGATTTCTGTCAAATAGGAAA 
TGTGTAAGTTCGTCTAAAAGCTGATGGTTATGTAAGTTGCTCAGGCACTCAGATGACAGCAGATTCTGGG 

35 TTCTGGGAGTGTTCTGTGCCTCTTACATGCCCTGGAGGCCTCATGGTCTCAGTGCTGAGGCGGCACACCT 
GTAGCACACCTGCGTAATGTGCGGTCTGGGCCAGTCACAAGGAATTGTGTTGTCTAAGCCAAAGGGGGAA 
GCTGACTGTGATTTACCAAAAAAAATTCTGTAATTCAAACCAAAATGTCTGCGGAATCACC^ 
CTCTCTGTAATCAGAACAGTGGGCAGTGCCTGGGTGAACGTGTCTAGCAGCCACTGTGCGGGATCGCTGT 
AACAGGAGTGGAATGTACATATTTATTTACTTTTCTAACTGCTCCAACAGCCAAATGCCTTTTTTATGAC 

40 CATTGTATTCAGTTCATTACCAAAGAAATGTTTGCACTTTGTAATGATGCCTTTCAGTTCAAATAAATGG 
GTCACATTTTCAAATGG 

SEQ ID NO:325 

>gi 1 20533846 | ref |XP_056455 . 2 | similar to Peroxidasin [Homo sapiens] 
45 MAKRSRGPGRRCLLALVLFCAWGTLAVVAQ 

FNRIREIQPGAFRRLRNLNTLIjLNKNQIKRI psgafedlenlkylylykneiqs idrqafkglasleqly 
LHFNQIETLDPDSFQHLPKLERLFLHimRITHL^^ 

AESGNAQAAAICEYPRRIQGRSVATITPEELNCERPRITSEPQDADVTSGNTVYFTCRAEGNPKPEIIWL 
ROTJNELSMKTDSRLNLLDDGTLMIQNTQETDQGIYQCMAKNVAGEVKTQEVTIjRYFG 
50 TEVLVGESVTIiECSATGHPPPRISWTRGDRTPLPVD^ 

VHATAFIIVQALPQFTWPQDRWIEGQTVDFQCEAK^^ 

GVALHDQGQYECQAWI I GS QKWAHLTVQPRVTP VFAS I PSDTTVEVGANVQLPCS SQGEPEPAITWNK 
DGVQVTESGKFHI S PEGFLTTND VGPADAGRYECVARNTIGSAS VSMVLS VNVPDVSRNGDPFVATS IVE 
AIATVDRAINSTRTHLFDSRPRSPNDLLALFRYPRDPOT^ 
55 TSYHYNDLVSPQYLNIiIANLSGCTAHRRVIOT 

YENGFNTPRG INPHRL YNGHALPMPRLVS TTLI GTETVTP^EQFTHMLMQWGQFLDHDIjDSTVVALSQAR 

FSDGQHCSNVCSNDPPCFSVMIPPITOSRARSGARCMFFVRSSPV^^ 

DASNVYGSTEHEARSIRDLASHRGLLRQGIVQRSGKPLL^ 

QLGLTSMHTLWFREHNRIATELIiKLNPHWDGDTIYYETRKIVGAEIQHITYQH^ 
60 HGYDPGINAGIFNAFATAAFRFGHTLWPLLYRLDENFQPIAQDHLPLHKAFFSPFRIVlTOGGIDPLIiRG 
LFGVAGKMRVPSQLLNTELTERLFSMAHTVAI^ 
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E IKNPE IREKLKRLYGSTLNIDLFPAIAttTCDI.VPG 

QLTQIKQTSLARILCDNADNITRVQSDVFRVAEFPHGYGSCDEIPRVDLRVWQDCCEDCRTRGQFNAFSY 
HFRGRRSIiEFSYQEDKPTKKTRPRKIPSVGRQGEHLSNSTSAFSTRSDASGTiroFREFVLEMQKTITDLR 
TQIKKLESRLSTTECVDAGGESHANNTi^ 
5 LQKRAEEKP 

SEQE)NO:326 

Novel WD40/SOCS box protein 
>Gli2_81_2_M13F Direction: anti-sense 

10 GGCTTGTACCTTGCCACGGTGGCAGATGACAGACTCCTCAGGATCTGGGCCCTGGAACTGAA7^ACTCCCAGTTGC 
ATATTGCATCCTATNACCAATGGGCTTTGCTGCACATTTTTTCCACATGGTGGAGTCATTGCCACAGGGACAAGA 
GATGGCCACNTCCAGTTCTGGACAGCTCCTAGGGTCCTGTCCTCACTGAAGGCACTTATGCCGGAAAGCCCTTCG 
AAGTTTCCTAACAACTTACCAAGTCCTAGCACTGCCTATCCCCAAGAAAATGAAAGAGTTCCTCACATACAGGAC 
TTTTGTTAAGCAACACCAGCATCTTGTGCTTCTTTGGTGAGGAGGTAAATCGTCCTGTCAAGGGAGTTGCTGGAA 

15 TAATGGGCCAAACATCTGGTCTTGCATTGAAATAGCACTTTCTTTGGGATTGTGAATAGAATGTAGCAAAACCAG 
ATTCCAGTGTACTAGTCATGGATCTTTCTCTCCCTGGCATGTGAAAGTCAGCTTAGAGGAAGAGATCCACTTGCA 
CCGGCACAGAGCCGCGGAGAACCAGCGACACTGGGAGCACGGTGGCAACGGCCCTGGCGGCTCTCCTGTACAGCT 
CGTCANGCCGAGAGAAGGCCAAAATNCGGGAAACCTGATGGAATCGCCCTTAGTGAGGGTATCCAGAACTGGCAA 
CATGCCANAAACGGTACCGGGTAAACTGTTCCCCCCCATCCCCAAAAAACCNACAAAGGACCGGCCCNCACATAA 

20 CAACACACAAAAGCGCACANCCATCACAGA 

SEQ ID NO:327 

>GL2__81_3_M13R Direction: anti- sense 

GGCTTGTACCTTGCCACGGTGGCAGATGACAGACTCCTCAGGATCTGGGCCCTGGAACTGAAAACTCCCATTGCA 
25 TTTGCTCCTATGACCAATGGGCTTTGCTGCACATTTTTTCCACATGGTGGAGTCATTGCCACAGGGACAAGAGAT 
GGCCACGTCCAGTTCTGGACAGCTCCTAGGGTCCTGTCCTCACTGAAGCACTTATGCCGGAAAGCCCTTCGAAGT 
TTCCTAACAACTTACCAAGTCCTAGCACTGCCAATCCCCAAGAAAATGAAAGAGTTCCTCACATACAGGACTTTT 
TTAAGCAACACCACATCTTGTGCTTCTTTGTAG 

30 SEQ ID NO:328 

>GL2__81_2_M13R Direction: sense 

CGGNTCTGTTGCCGTGCAAGTGGAATCATCTTACCTCTAAGACATGACTTTCACATGCCAGGGGAGAGAAAGATC 
CATGACTAGTACACTGGCAATCTGGTTTTGCTACATTCTATTCACAATCCCNAAGAAATGCTATTTCAATGCAAG 
ACCAGATGTTTGGCCCATTATTCCAGCAACTCCCTTTGACAGGACGATTTACCCTGCTACAACAGAAGCACACAG 

35 ATGTGGTGTTGCTTAAAAAAGTCCTGTACTGTGAGGAACTCTTTCACTTTTCTTGGGGATTGGCCACGTGCTATG 
GACTTAGGTAAGTTGTTCAGGAAACTTCGCAAGGGCTTTCCGGCATAAGTGCTATCAGTGAGGACAGGACCCTAG 
TGAGCTGTCCAGAACTGGACGTGGCCATCTCTTGTCCCTGTGGCCAATGACTCCACCATGTGGAATACATGTNCA 
GCAAAGCCCATTGGTCATAAGAACAAAATGCAATCGGGAGTTNCCAGGTCCAGGGGCCCGAGATCCTGAGGGAGG 
TCNTGTGCAATCTGCCAGCGTGGCAAGGGTACCAAGCCCCCGANAAAACCCCAGAGCAACAAGGTGGGTCTCAGA 

40 ATNAGAATCAAGCGGGCCGGTCCGANACTAGCCTTCGCAGGGGTCCTGAAATACTCACCAAAGGGNCGGCAAATA 
AGGGGGGGCACATTCGTTTCAAAACCTCGCCCGAAAAAGACNCCTAGGATGAANGGCTACAAATGACCAGCGTGA 
GAAACACTGGCCNCGCAGACCCCGGGGTGAC 

SEQ ID NO:329 
45 Novel WD40/SOCS box protein> 

CGGCTCTGTTGCCGTGCAAGTGGAATCATCTTACCTCTAAGACATGACTTTCACATGCCAGGGGAGAGAAAGATC 
CATGACTAGTACACTGGCAATCTGGTTT^ 

ACCAGATGTTTGGCCCATTATTCCAGCAACTCCCTTTGACAGGACGATTTACCCTGCTACCAAAGAAGCACAAGA 
TGTGGTGTTGCTTAAAAAAGTCCTGTATGTGAGGAACTCTTTCATTTTCTTGGGGATTGGCAGTGCTAGGACTTG 
50 GTAAGTTGTTAGGAAACTTCGAAGGGCTTTCCGGCATAAGTGCTTCAGTGAGGACAGGACCCTAGGAGCTGTCCA 
GAACTGGACGTGGCCATCTCTTGTCCCTGTGGCAATGACTCCACCATGTGGAAAAAATGTGCAGCAAAGCCCATT 
GGTCATAGGAGCAAATGCAATGGGAGTTTTCAGTTCCAGGGCCCAGATCCTGAGGAGTCTGTCATCTGCCACCGT 
GGCAAGGTACAAGCCCCCG 

55 SEQIDNO:330 

>gi | 20149658 | ref |NM_018639 . 2 | Homo sapiens CS box- containing WD protein 
(LOC55884) , mRNA 

CGGACGCGTGGGGCCCGGGGGGCTGAGGCGCCCGCCGCCTGCCGCGGGGGCCGCTCGCGTCCTCCATGGA 
GGCCGGAGAGGAACCGCTGCTGCTGGCCGAACTCAAGCCCGGGCGCCCCCACCAGTTTGATTGGAAGTCC 
60 AGCTGTGAAACCTGGAGCGTCGCCTTCTCCCCAGATGGCTCCTGGTTTGCTTGGTCTCAAGGACACTGCA 
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TCGTCAAACTGATC CCCTGGCCGTTGGAGGAGC AGTTC ATC CCTAAAGGGTTTGAAGCCAAAAGCCGAA.G 
TAGCAAAAATGAGACGAAAGGGCGGGGCAGCCCAAAAGAGAAGACGCTGGACTGTGGTCAGATTGTCTGG 
GGGCTGGCCTTCAGCCCGTGGCCTTCCCCACCCAGCAGGAAGCTCTGGGCACGCCACCACCCCCAAGTGC 
CCGATGTCTCTTGCCTGGTTCTTGCTACGGGACTCAACGATGGGCAGATCAAGATCTGGGAGGTGCAGAC 
5 AGGGCTCCTGCTTTTGAATCTTTCCGGCCACCAAGATGTCGTGAGAGATCTGAGCTTCACACCCAGTGGC 
AGTTTGATTTTGGTCTCCGCGTCACGGGATAAGACTCTTCGCATCTGGGACCTGAATAAACACGGTAAAC 
AGATTCAAGTGTTATCGGGCCACCTGCAGTGGGTTTACTGCTGTTCCATCTCCCCAGACTGCAGCATGCT 
GTGCTCTGCAGCTGGAGAGAAGTCGGTCTTTCTATGGAGCATGAGGTCCTACACGTTAATTCGGAAGCTA 
GAGGGCCATCAAAGCAGTGTTGTCTCTTGTGACTTCTCCCCCGACTCTGCCCTGCTTGTCACGGCTTCTT 

1 0 ACG ATAC CAATGTGATTATGTGGGAC CCCTAC ACCGGCGAAAGGCTGAGGTC ACTCC ACCAC AC CCAGGT 
TGACCCCGCCATGGATGACAGTGACGTCCACATTAGCTCACTGAGATCTGTGTGCTTCTCTCCAGAAGGC 
TTGTACCTTGCCACGGTGGCAGATGACAGACTCCTCAGGATCTGGGCCCTGGAACTGAAAACTCCCATTG 
CATTTGCTCCTATGACCAATGGGCTTTGCTGCACATTTTTTCCACATGGTGGAGTCATTGCCACAGGGAC 
AAGAGATGGCCACGTCCAGTTCTGGACAGCTCCTAGGGTCCTGTCCTCACTGAAGCACTTATGCCGGAAA 

15 GCCCTTCGAAGTTTCCTAACAACTTACCAAGTCCTAGCACTGCCAATCCCCAAGAAAATGAAAGAGTTCC 
TCACATACAGGACTTTTTAAGCAACACCACATCTTGTGCTTCTTTGTAGCAGGGTAAATCGTCCTGTCAA 
AGGGAGTTGCTGGAATAATGGGCCAAACATCTGGTCTTGCATTGAAATAGCATTTCTTTGGGATTGTGAA 
TAGAATGTAGCAAAACCAGATTCCAG TG TACTAGTCATGGATCTTTCTCTCC C TGGCATGTGAAAGTCAG 
TCTTAGAGGAAGAGATTCCACTTGCACGGCAACAGAGCCTTACGTTAAATTTTCAGTCCAGTTATGAACA 

20 

GCAAGTGTTGAACTCTTTCTGCTTGTTTTGATTCAAAGTGCAGTTACTGATGTTGTTTTGATTATGCAAC 
TAAGTAGGCCTCCAGAGCCTCTCTAGTGGCAGAGCAGCTCACACTCCCTCCGCTGGGAACGATGGCTTCT 
GCCTAGTACCTATCCTTGTGTTTCTGATGCAGTGGTAGCATTGGTTCAAGTTCTCTCCTGCTGTGGTCAG 
AGTTGCTTCGATGTTGGCCAAGTGCTTTTCTTCTTGGGCTCCCTTCTGACCTGCAGGACAGTTTTCCTGG 
AGCCATTTGGTATGAGGTATTAATTTAGCTTAACTAAATTACAGGGGACTCAGAGGCCGTGCTCCTGACC 
25 GATCCAGACACTATTACTGGCTTTTTTTTTTTTTTAACAATGGTGTGCATGTGCAGGAAATGACAAATTT 
GTATGTCAGATTATACAAGGATGTATTCTTAAACCGCATGACTATTCAGATGGCTACTGAGTTATCAGTG 

TTATCTAAAAACCATAGTACTTAAATTGAACAGTTGCAAAGATGTCTTAATTGTGTAAAGAATTGGTGTA 
GTCATGACTTTAGCTGATACTCTTATGTACGAGATCTGTCTCTGCTGTTTAACTTCATTGGATTAATCAG 

30 CTGGTTTCAACTCTACTGCGAAACAAAAATAGCTCCTTAAAAGTACTGTTCTCCTTCAGTGGCATGTAGT 
TATCTAATCAAGACACCTCATTCAAACAAAACCTGCCTTAGGAAAATTTAATATATTTTAAATTATTTTA 
AAAGAAATACAACATCTTATTCTTTAGCTTTCTTAATCGGTGCTTTATGGAGGCCAGTGTAACGTTACAT 
GACTCGTTGAGAAAGTTGAGGAATTTCCTCTACCACCTITGTTGCTTGAAGAAAAACATGTCTTTTCAAA 
ATGAGAGGCTTTCATTGAAGAAAAGAAT^AAAACAACAGTTAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

35 AAAAAAAAAAAAAAAAAAAA 

SEQIDNO:331 

>gi | 8923881 |ref |NP_061109 . 1 | CS box- containing WD protein [Homo sapiens] 
MEAGEEPLLLAELKPGRPHQFDWKSSCETWSVAFSPDGSWFAWSQGHCIVKLIPWPLEEQFIPKGFEAKS 
40 RSSKNETKGRGSPKEKTIiDCGQIVWGl^^ 

QTGLLLIjNLSGHQDVVRDLSFTPSGSLILVSASRDK^ 

MLCSAAGEKSVFLWSMRSYTLIRKljEGHQSSVVSCDFSPDSALLVTASYDTNVIMWDPyTGERLRSIJIHT 
QVDPAMDDSDVHISSLRSVCFSPEGLYLATVADDRLL^ 

GTRDGHVQFWTAPRVLSSLKHLCRKALRSFLTTYQVLALPIPIOCMKEFLTYRTF 

45 

SEQ ID NO:332 
Numb- like 

>GL3_16A_3_M13R Direction: anti-sense 

CGTCTGCCTGCCACTGGTGCGGGCGCGACGCCTCGGGCACGTAGGCTGGCTTCCTCCGCCGCAGGGCTCTTGNCC 
50 GGTAAAACCTTGTTCATGGTGCCCGCACCCGTCTGGCTCCGTCCTGCAGGGTTTCCTGGGGGGCCCCGGGGCCCC 
CACAAGGGGGACTGGGGGGCAGGGTGCCGGTTCAAGGGCCCCCCTTCCTTTGTTACAGGCTTCGGTCCAATGCNG 
AGAAGGAAAGGGGCGAAATTCNCTGCAGAATA 

SEQIDNO:333 

55 >GL3_16A_4 - M13R Direction: anti- sense 

CGTCTGCCTGCCACTGGTGCGGGCGCGACGCCTCGGGCACGTAGGCTGGCTTCCTCCGCCGCAGGCTCTGCCGTA 
ACTTGTTCATGGTTGCCCGCCCCGTCTGGCTCCGTCCTGCAGGTTTCTGGAGGCCACCGTTGCCATCGGGCCTAT 
GGGCGGGCCTTCCTTCCCTTTGTTACCAGGCCTTCCGGTTCCCCAATAGCCCGGGAAGGAAGAAAACGGGGGCGG 
AAATTTCCTTGGCCAGGAGTTATTCCCNATTTCAAACAAACTTGGGCCGGGGGACCCGGAGTTTCCGGAAAGCNC 

60 ATATTTGNCAAATTTCATAAGGAGAAGGGGGGGGGCCCCCACAAAATTTTTCGGCAGCCCCCTTTAATTTANGGT 
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TTGAGAAAGGATTTCCGGGTAATTTTTACACCAAAATTTTTCAACTTTGTGGGGCGCCCAGGATACAGAGGATAT 
TTAAAACACATVAAAACGGCGGGGTANCAGGGGl^TGGGGNCCTTTGGGGGGGAGAGAAGAAA^CCCGCCCCCCT 
TTTTGTGGGCCGNGGNTTTTATAACGCCCCCCCCGANAAAATCCTTTTATAAAATTTCCATGCGAGCCGCCCGTT 
TTTTTTNGGNGTGAGAGAGGGACAGGAGCACAAATATTTTTCCCCCCCGCCCTNTTTATTTTTTTACCGAGGGCG 
5 CCCCAAANAGGCGTTATTGGGGACGCGGCCGGTTTACAAAATAATAATGACGCTGCGAGAAAAAATAGAGAAAAG 
GGTGGCCCCCCCCCTGCGGAGAANAACCCCGCGGNGGNAGATATTACTTGGCAGGGAGCGCCCCTTTTTAT 

SEQ ED NO:334 

>GL3_3 9L_1JVI13R Direction: anti- sense 

10 CGTCTGCCTGCCACTGGTGCGGGCGCGACGCCTCGGGCACGTAGGCTGGCTTCCTCCGCCGCAGGCTCTGCCGTA 
ACTTGTTCATGGTGCCCGCCCCCGTCTGGCTCCGTCCTGCAGGTTTTCTGGGGGCCCCGGGGCCCCCAACAAGGG 
GGGGCCTTGGGGGGGCCAGGGTTGCCGGAGAAGCCCAAGGCAACACTTGGNGAAAGCCCAACCGGNTAGGCACAA 
CCGGGGCCACTTATNGGCGANGGAGCCATNACACATTCCCTTTTGGTTACCCAAGGGCANATCCGGATCCCCAAT 
TTGC^CCCGGANATGGAAAGCAAAAGGGGGCGCCGGAAAAAATTTCCTTTGCCCGAGGATTAATTTCCCCAAATT 

15 CCAAACACACCCTGTGGTGGCACGGGNGACCACGGAATTCCGAAAAGGCCNATTTTGGCCCATATTCTTTTAGGG 
AAAGGGGGGGCCCCCCCCAAAAGAATTTTGTTCCNGGCCCCCCATATATAAA 

SEQ ID NO:335 

>GIj3__16A_2_M13R Direction: anti-sense 

20 CGTCTGCCTGCCACTGGTGCGTGGCGCGACGCCTCGGGCACGTAGGCTGGCTTCCTCCGCCGCAGGCTCTGCCGT 
TAAACTTGTTCATTGGTGCCCGCCCCGTTCTTGGCCTCCGTCCCTTGCAGGTTTCTGGGGGCCCCGGNGGGGCCC 
CCAAACAAGGGGGTGGCTTTGGAAGCCCCAACCGGGTTGCCAACNCGGACACTATGGGGGGACCTTTCACATTCC 
TTTTGTTACCAGGCTTCGGTTCACAATTGGCCCGAAGGAAAGAAAAGGGGGCGCGGAAATTTCCTTGTGCCAAGG 
ATTTTATTCCCCAATTTTCAAGCGACCGTTGGGCCGGGGACCCCGGACTCCGAAGCCATTTGGCAATTCTATNAG 

25 GGAAAGGGGNGCCCCCAAAATTTCGCGGACCCTATAAATAAGATTGAGCAGACTCGGGTAATTTATAACAAATTT 
TACCACAACGCGTTGTTGGGGACCCCCGGGTATACTGCAGGTTATTTATTATAACCAWAAAAGGCGGTGATATTC 
CANGTAGTTGTGGAAAACCTGTTGTGGNGGGGAAACAAAAAAAAACAACACCCCCCCCCTTTTTTGGGGTANGGC 
CGGGTGATTTTTATAACAACCCCCCCAGCCAAAAAGAAGCCGCTTTTTNTAANAAAG 

30 SEQIDNO:336 

>GIj3_39L_3JVI13F Direction: N/A 

CGAGGAGG CTCGCGCGCGTGCATCCCGCACCATCCCCCAGCCCCCAGAAACCTGCAGGACGGAGCCAGACGGGGT 
GGGCACCATGAACAAGTTACGGCAGAGCCTGCGGCGGAGGAAGCCAGCCTACGTGCCCGAGGGCCGTCGCGCCCG 
AACCAGTGGCAGGCAGACG 

35 

SEQ ID NO:337 

>GL3_39Ii_3_M13R Direction: anti -sense 

CGTCTGCCTGCCACTGGTGCGGGCGCGACGCCTCGGGCACGTAGGCTGGCTTCCTCCGCCGCAGGCTCTGCCGGT 
AACTTGTTCATGGTGCCCACCCCGTCTGGCTCCGTCCTGCAGGTTTCTGGGGGCTGGGGGATGGTGCGGGATGCA 
40 ACGCGCGCGAGCTTCCTCG 

SEQ ID NO:338 

>GLi3_16A_ljyil3F Direction: anti-sense 

CGTCTGCCTGCCACTGGTGCGGGCGCGACGCCTCGGGCACGTAGGCTGGCTTCCTCCGCCGCAGGCTCTGCCGTA 
45 ACTTGTTCATGGTGCCCGCCCCGTCTGGCTCCGTCCTGCAGGTTTCTGGGGGATGGTGCGGGATGCACGCGCGCG 
AGCCTCCTCG 

SEQ ID NO:339 

>GL3_16A_1_M13R Direction: sense 
50 CGAGGAGGCTCGCGCGCGTGCATCCCGNACCATCCCCCAGAAACCTGCAGGACGGAGCCAGACGGGGCGGGCACC 
ATGAACAAGTTACGGCAGAGCCTGCGGCGGAGGAAGCCAGCCTACGTGCCCGAGGCGTCGCGCCCGACCAGTGGC 
AGGCAGACG 

SEQ ID NO:340 

55 >gi | 10863898 | ref | WM_004756 . 1 | Homo sapiens numb homolog (Drosophila) -like 
(NUMBIi) , mRNA 

ATGTCCCGCAGCGCGGCGGCCAGCGGCGGACCCCGGAGGCCTGAGCGGCACCTGCCCCCAGCCCCCTGTG 
GGGCCCCGGGGCCCCCAGAAACCTGCAGGACGGAGCCAGACGGGGCGGGCACCATGAACAAGTTACGGCA 
GAGCCTGCGGCGGAGGAAGCCAGCCTACGTGCCCGAGGCGTCGCGCCCGCACCAGTGGCAGGCAGACGAG 
60 GACGCGGTGCGGAAGGGCACGTGCAGCTTCCCGGTCAGGTACCTGGGTCACGTGGAGGTAGAGGAGTCCC 
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GGGGAATGCACGTGTGTGAAGATGCGGTGAAGAAGCTGAAGGCGATGGGCCGAAAGTCCGTGAAGTCTGT 
CCTGTGGGTGTCAGCCGATGGGCTCCGAGTGGTGGACGACAAAACCAAGGATCTTCTGGTCGACCAGACC 
ATCGAAAAGGTCTCCTTTTGTGCTCCTGACCGCAACCTGGACAAGGCTTTCTCCTATATCTGTCGTGACG 
GGACTACCCGCCGCTGGATCTGCCACTGTTTTCTGGCACTGAAGGACTCCGGCGAGAGGCTGAGCCACGC 
5 TGTGGGCTGTGCTTTTGGCGCCTGCCTGGAGCGAAAACAGCGACGGGAGAAGGAATGTGGGGTCACGGCC 
GCCTTCGATGCCAGCCGCACCAGCTTCGCCCGCGAGGGCTCCTTCCGCCTGTCTGGGGGTGGGCGGCCTG 
CTGAGCGAGAGGCCCCGGACAAGAAGAAAGCAGAGGCAGCAGCTGCCCCCACTGTGGCTCCTGGCCCTGC 
CCAGCCTGGGCACGTGTCCCCGACACCAGCCACCACATCCCCTGGTGAGAAGGGTGAGGCAGGCACCCCT 
GTGGCTGCAGGCACCACTGCGGCCGCCATCCCCCGGCGCCATGCACCCCTGGAGCAGCTGGTTCGCCAGG 

10 GCTCCTTCCGTGGGTTCCCAGCACTCAGCCAGAAGAACTCGCCTTTCAAACGGCAGCTGAGCCTACGGCT 
GAATGAGCTGCCATCCACGCTGCAGCGCCGCACTGACTTCCAGGTGAAGGGCACAGTGCCTGAGATGGAG 
CCTCCTGGTGC CGGCGACAGTGACAG CATCAACGCT CTGTGC ACACAGATCAGTTCATCTTTTGCCAGTG 
CTGGAGCGCCAGCACCAGGGCCACCACCTGCCACAACAGGGACTTCTGCCTGGGGTGAGCCCTCCGTGCC 
CCCTGCAGCTGCCTTCCAGCCTGGGCACAAGCGGACACCTTCAGAGGCTGAGCGATGGCTGGAGGAGGTG 

15 TCACAGGTGGCCAAGGCCCAGCAGCAGCAGCAGCAGCAACAGCAACAGCAGCAGCAGCAGCAGCAGCAAC 
AGCAGCAAGCAGCCTCAGTGGCCCCAGTGCCCACCATGCCTCCTGCCCTGCAGCCTTTCCCCGCCCCCGT 
GGGGCCCTTTGACGCTGCACCTGCCCAAGTGGCCGTGTTCCTGCCACCCCCACACATGCAGCCCCCTTTT 
GTGCCCGCCTACCCGGGCTTGGGCTACCCACCGATGCCCCGGGTGCCCGTGGTGGGCATCACACCCTCAC 
AGATGGTGGCAAACGCCTTCTGCTCAGCCGCCCAGCTCCAGCCTCAGCCTGCCACTCTGCTTGGGAAAGC 

20 TGGGGCCTTCCCGCCCCCTGCCATACCCAGTGCCCCTGGGAGCCAGGCCCGCCCTCGCCCCAATGGGGCC 
CCCTGGCCCCCTGAGCCAGCGCCTGCCCCAGCTCCAGAGTTGGACCCCTTTGAGGCCCAGTGGGCGGCAT 
TAGAAGGCAAAGCCACTGTAGAGAAACCCTCCAACCCCTTTTCTGGTGACCTGCAAAAGACATTCGAGAT 
TGAACTGTAG 

25 SEQ ID NO:341 

>gi | 10863899) ref |NP_004747 .l| numb homolog (Drosophila) -like ; numb 
(Drosophila) homolog-like [Homo sapiens] 

msrsaaasggprrperhlppapcgapgppetcrtepdgagtmnklrqslrrrkpawpeasrphqwqade 
davrkgtcsfpvrylghveveesrgmhvcedavkklkamgrk^ 
30 iekvsfcapdri^dkafsyicrdgttrrwichcflalk^ 

afdasrtsfaregsfrlsgggrpaereapdkkkaeaaaaptvapgpaqpghvsptpattspgekgeagtp 
vaagttaaaiprrhapleqlvrqgsfrgfpalsqknspfkrqlsl^ 

PPGAGDSDS INAIjCTQI s s s FASAGAPAPGPPPATTGTS AWGEPS vppaaafqpghkrtpseaerwleev 
SQVAKAQQQQQQQQQQQQQQQQQQQQAASVAPVPTMPPALQPFPAPVGPFDAAPAQVAVFLPPPHMQPPF 
35 VPAYPGLG YPPMPRVP WG I TPS QMVANAFCS AAQIiQPQPATLLGKAGAF PP PAI P S APGS QARPRPNGA 
PWPPEPAPAPAPEIiDPFEAQWAALEGKATVEKPSNPFSGDLQKTFEIEL 

SEQ ID NO:342 

P13 0Cas (Crk-associated substrate) 
40 >GIil_43_2_jL_jyil3F Direction: sense 

CGCGGGATGTCGTACTCGTCCTGCTCCGGCTGGGCGGCCTCGTATACATAGCCCTGCCCCACGCGGGTGGGCACC 
ACCACCTTTGCCGGGGGCTTCGTGCCCTCCCAGCTGCGTGTGTCCATGGACGGGGGGACCTGGTAGATGTCATGC 
CCCATCCCGGCAGAAGGTGGCACCTGGTAAATATCCTGGGCAGGGCCTCCAGGCCCTGGGGGCACCTGGTACAGG 
TCTGTGGCCGGGCTGGGAAACGGGTGATGGGGTGTCTGCTTCGAGAAGGTGGATGTCTGCTTGGC 

45 

SEQ ID NO:343 

>GL1_43_2_1JM13R Direction: anti-sense 

GCCAAGCAGACATCCACCTTCTCGAAGCAGACACCCCATCACCCGTTTCCCAGCCCGGCCACAGACCTGTACCAG 
GTGCCCCCAGGGCCTGGAGGCCCTGCCCAGGATATTTACCAGGTGCCACCTTCTGCCGGGATGGGGCATGACATC 
50 TACCAGGTCCCCCCGATCCATGGACACACGCAGCTGGGAGGGCACGAAGCCCCCGGCAAAGGTGGTGGTGCCCAC 
CCGCGTGGGGCAGGGCTATGTATACGAGGCCGCCCAGCCGGAGCAGGACGAGTACGACATCCCGCG 

SEQ ID NO:344 

>GIil__43_J3__6_M13F Direction: sense 
55 CGCGGGATGTCGTACTCGTCCTGCTCCGGCTGGGCGGCCTCGNTATACATAGCCCTGCCCCACGCGGGTGGGCAC 
CACCACCTTTGCCGGGGGCTTCGTGCCCTCCCAGCTGCGTGTGTCCATGGACGGGGGGACCTGGTAGATGTCATG 
CCCCATCCCGGCAGAAGGTGGCACCTGGTAAATATCCTGGGCAGGGCCTCCAGGCCCTGGGGGCACCTGGTACAG 
GTCTGTGGCCGGGCTGGGAAACGGGTGATGGGGTGTCTGCTTCGAGAAGGTGGATGTCTGCTTGGC 
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SEQIDNO:345 

>GI»1_43_3_6__M13R Direction: anti-sense 

GCCAAGCAGACATCCACCTTCTCGAAGCAGACACCCCATCACCCGTTTCCCAGCCCGGCCACAGACCTGTACCAG 
GTGCCCCCAGGGGCCTGGAGGCCCTGCCCAGGATATTTACCAGGTGCCACCTTCTGCCGGGATGGGGCATGACAT 
5 CTACCAGGTCCCCCCGTCCATGGACACACGCAGCTGGGAGGGCACGAAGCCCCCGGCAAAGGTGGTGGTGCCCAC 
CCGCGTGGGGCAGGGCTATGTATACGAGGCCGCCCAGCCGGAGCAGGACGAGTACGACATCCCGCG 

SEQ ID NO:346 

>GL1_43_4_4_M13F Direction: sense 
10 CGCGGGATGTCGTACTCGTCCTGCTCCGGCTGGGCGGCCTCGTATACATAGCCCTGCCCCACGCGGGTGGGCACC 
ACCACCTTTGCCGGGGGCTTCGTGCCCTCCCAGCTGCGTGTGTCCATGGACGGGGGGACCTGGTAGATGTCATGC 
CCCATCCCGGCAGAAGGTGGCACCTGGTAAATATCCTGGGCAGGGCCTCCAGGCCCTGGGGGCACCTGGTACAGG 
TCTGTGGCCGGGCTGGGAAACGGGTGATGGGGTGTCTGCTTCGAGAAGGTGGATGTCTGCTTGGC 

15 SEQIDNO:347 

>Glii_43_4_4__M13R Direction: anti -sense 

GCCAAGCAGACATCCACCTTCTCGAAGCAGACACCCCATCACCCGTTTCCCAGCCCGGCCACAGACCTGTACCAG 
GTGCCCCCAGGGACCTGGAGGCCCTGCCCAGGATATTTACCAGGTGCCACCTTCTGCCGGGATGGGGCATGACAT 
CTACCAGGTCCCCCCGATCCATGGACACACGCAGCTGGGAGGGCACGAAGCCCCCGGCAAAGGTGGTGGTGCCCA 
20 CCCGCGTGGGGCAGGGCTATGTATACGAGGCCGCCCAGCCGGAGCAGGACGAGTACGACATCCCGCG 

SEQIDNO:348 

>GL1_43_4_6_M13F Direction: sense 

CGCGGGATGTCGTACTCGTCCTGCTCCGGCTGGGCGGCCTCGTATACATAGCCCTGCCCCACGCGGGTGGGCACC 
25 ACCACCTTTGCCGGGGGCTTCGTGCCCTCCCAGCTGCGTGTGTCCATGGACGGGGGGACCTGGTAGATGTCATGC 
CCCATCCCGGCAGAAGGTGGCACCTGGTAAATATCCTGGGCAGGGCCTCCAGGCCCTGGGGGCACCTGGTACAGG 
TCTGTGGCCGGGCTGGGAAACGGGTGATGGGGTGTCTGCTTCGAGAAGGTGGATGTCTGCTTGGC 

SEQIDNO:349 
30 >GL1_43_4_6__M13R Direction: anti- sense 

GCCAAGCAGACATCCACCTTCTCGAAGCAGACACCCCATCACCCGTTTCCCAGCCCGGCCACAGACCTGTACCAG 
GTGCCCCCAGGGNCCTGGAGGCCCTGCCCAGGATATTTACCAGGTGCCACCTTCTGCCGGGATGGGGCATGACAT 
CTACCAGGTCCCCCCGATCCATGGACACACGCAGCTGGGAGGGCACGAAGCCCCCGGCAAAGGTGGTGGTGCCCA 
CCCGCGTGGGG CAGGGCTATGTATACGAGGCCGCCCAGCCGGAGCAGGACGAGTACGACATCCCGCG 

35 

SEQ ID NO.350 

>GLl_43_LOW_3_G3Fl Direction: sense 

GCGGGATGTCGTACTCGTCCTGCTCCGGCTGGGCGGCCTCGTATACATAGCCCTGCCCCACGCGGGTGGGCACCA 
CCACCTTTGCCGGGGGCTTCGTGCCCTCCCAGCTGCGTGTGTCCATGGACGGGGGGACCTGGTAGATGTCATGCC a 
40 CCATCCCGGCAGAAGGTGGGACCTGGTAAATATCCTGGGCAGGGCCTCCAGGCCCTGGGGGCACCTGGTACAGGT " 
CTGTGGCCGGGCTGGGAAACGGGTGATGGGGTGTCTGCTTCGAGAAGGTGGATGTCTGCTTGGC 

SEQIDNO:351 

>gi | 7656923 | ref |NM_014567 . 1 1 Homo sapiens breast cancer anti-estrogen 

45 resistance 1 (BCAR1) , mRNA 

GAGGCGGCAGCTGCGCGGCGGCACCGGGGCGGCTGCGGCGCGCTCGGAGCCCCGAGGCACGCGGCCCGGG 
CAGCTCGGTGTGCGCCCCCGCGAGAGCCGGGCCCCAGGCCCGCCGGACACCATGAACCACCTGAACGTGC 
TGGCCAAAGCGCTCTATGACAATGTGGCCGAGTCCCCGGATGAGCTCTCCTTCCGCAAGGGTGACATCAT 
GACGGTGCTGGAGCAGGACACGCAGGGCCTGGACGGCTGGTGGCTCTGCTCGCTGCATGGGCGCCAGGGC 

50 ATCGTGCCTGGGAACCGCCTCAAGATCTTGGTGGGCATGTATGATAAGAAGCCAGCAGGGCCTGGCTCCG 
GCCCTCCCGCCACCCCGGCCCAGCCTCAGCCTGGCCTCCATGCCCCAGCGCCTCCGGCCTCCCAGTACAC 
GCCCATGCTCCCCAACACCTACCAGCCCCAGCCAGACAGCGTCTACCTGGTGCCCACTCCCAGCAAGGCT 
CAGCAAGGCCTCTACCAAGTCCCGGGTCCCAGCCCTCAGTTCCAGTCTCCCCCAGCCAAGCAGACATCCA 
CCTTCTCGAAGCAGACACCCCATCACCCGTTTCCCAGCCCGGCCACAGACCTGTACCAGGTGCCCCCAGG 

55 GCCTGGAGGCCCTGCCCAGGATATTTACC^GGTGC^^ 

GTCCCCCCGTCCATGGACACACGCAGCTGGGAGGGCACGAAGCCCCCGGCAAAGGTGGTGGTGCCCACCC 
GCGTGGGGCAGGGCTATGTATACGAGGCCGCCCAGCCGGAGCAGGACGAGTACGACATCCCGCGACACCT 
GCTGGCCCCGGGGCCACAGGACATCTATGATGTGCCCCCGGTTCGGGGGCTGCTTCCCAGCCAGTATGGC 
CAGGAGGTGTATGACACACCCCCCATGGCTGTCAAGGGTCCCAATGGCCGAGACCCGTTGCTGGAGGTGT 

60 ATGACGTGCCCCCCAGTGTGGAGAAGGGCCTGCCACCGTCCAACCACCACGCAGTCTACGACGTTCCTCC 
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ATCGGTGAGCAAGGATGTGCCCGATGGCCCACTGCTGCGTGAGGAGACCTACGATGTGCCCGCCGCCTTC 
GCCAAGGCCAAGCCCTTTGACCCGGCCCGCACCCCACTGGTACTGGCTGCGCCCCCTCCAGACTCCCCGC 
CGGCCGAGGACGTGTATGACGTGCCGCCCCCGGCTCCTGACCTCTACGACGTGCCCCCTGGCTTGCGGCG 
GCCTGGCCCGGGCACCCTGTACGATGTGCCCCGTGAACGGGTGCTTCCTCCTGAGGTGGCTGATGGTGGC 
5 GTGGTCGACAGTGGTGTGTATGCGGTGCCTCCCCCAGCTGAACGTGAAGCCCCGGCAGAGGGCAAGCGCC 
TGTCGGCCTCCAGCACCGGCAGCACACGCAGCAGCCAGTCTGCGTCCTCCTTGGAGGTGGCAGGGCCGGG 
CCGGGAACCCCTGGAGCTGGAAGTTGCTGTGGAGGCCCTGGCACGGCTGCAGCAGGGTGTGAGCGCCACC 
GTTGCCCACCTTCTGGACCTGGCAGGCAGCGCCGGTGCGACTGGGAGCTGGCGTAGCCCCTCTGAGCCAC 
AGGAGCCGCTGGTGCAGGACCTGCAGGCTGCTGTGGCCGCTGTCCAGAGTGCCGTCCACGAGCTGTTGGA 

10 GTTTGCCCGCAGCGCGGTGGGCAATGCTGCCCACACATCTGACCGTGCCCTGCATGCCAAGCTTAGCCGG 
CAGCTGCAGAAGATGGAGGACGTGCACCAGACGCTGGTGGCACATGGTCAGGCCCTCGACGCTGGCCGGG 
GAGGCTCTGGAGCCACCCTTGAGGACCTGGACCGGCTGGTGGCCTGCTCGCGGGCTGTGCCCGAGGACGC 
CAAGCAGCTGGCCTCCTTCCTGCACGGCAATGCCTCACTGCTCTTCAGACGGACCAAGGCCACTGCCCCG 
GGGCCTGAGGGGGGTGGCACCCTGCACCCCAACCCCACTGACAAGACCAGCAGCATCCAGTCACGACCCC 

15 TGCCCTCACCCCCTAAGTTCACCTCCCAGGACTCACCAGATGGGCAGTACGAGAACAGCGAGGGGGGCTG 
GATGGAGGACTATGACTACGTCCACCTACAGGGGAAGGAGGAGTTTGAGAAGACCCAGAAGGAGCTGCTG 
GAAAAGGGCAGCATCACGCGGCAGGGCAAGAGCCAGCTGGAGTTGCAGCAGCTGAAGCAGTTTGAACGAC 
TGGAACAGGAGGTGTCACGGCCCATAGACCACGACCTGGCCAACTGGACGCCAGCCCAACCCCTGGCCCC 
GGGGCGAACAGGCGGCCTGGGGCCCTCGGACCGGCAGCTGCTGCTCTTCTACCTGGAGCAGTGTGAGGCC 

20 AACCTGACCACACTGACCAACGCCGTGGACGCCTTCTTTACCGCCGTGGCCACCAACCAGCCGCCCAAGA 
TCTTTGTGGCGCACAGCAAGTTCGTCATCCTCAGCGCCCACAAGCTGGTGTTCATCGGGGACACACTGTC 
ACGGCAGGCCAAGGCTGCTGACGTGCGCAGCCAGGTGACCCACTACAGCAACCTGCTGTGCGACCTCCTG 
CGCGGCATCGTGGCCACCACCAAGGCCGCTGCCTTGCAGTACCCATCGCCTTCCGCGGCCCAGGACATGG 
TGGAGAGGGTCAAGGAGCTGGGCCACAGCACCCAGCAGTTCCGCCGCGTCCTAGGCCAGCTGGCAGCCGC 

25 CTGAGGGTGGTGACCCCAGGAGGGAGGCAGGGGAGGGGTGCGGCGGTCCCAGCTCCCTGGCTCCCATGTC 
AAGAGTCGCTGTGCCACAGGCTTAGGGACAGGACCCCAGCTCTGCGTCGGTCCTGGTGCCCTGGATGCCC 
AGGAATCTGTATATATTTATGGCCGGGCAGGGTGTGGGGCCATGCCTCCTCAGGAGCCGAAGCCCAGGGG 
CCGCAGTGGCCTTCCCCAGCATGCACCACGGGCCCGGGTTGGGTCACCAGACGGGGCTGGAGTGTGAGGG 
TCCTGCAGCCTGCAGGACCTCGTGCCACCCCGAGGGCTGAGCCTGGTCCCACGAGGGTGCCGTGTCCCCT 

30 GACAGGGCCAGTGCAGTTTGGTGTGTCCTCCGCCTTACCAGGAGAAGAACCTGAAGAACTATTTTTCGTT 
ATTGGTTTTCCAATCATTTGACTAAGAGTCTCCATTTAAATAAAGTTTTTAAAAGGAA 

SEQ ID NO:352 

>gi | 7656924 | ref|NP_0553 82.1 | breast cancer anti -estrogen resistance 1; Crk- 

35 associated substrate pl30Cas [Homo sapiens] 

MNHLNVLAKALYDNVAES PDELS FRKGDIMTVLEQDTQGLDGWWLCSLHGRQGIVPGNRLKILVGMYDKK 
PAGPGSGPPATPAQPQPGLHAPAPPASQYTPMLPNTYQPQPDSVYLVPTPSKAQQGDYQVPGPSPQPQSP 
PAKQTS TFS KQTPHHP F P S P ATDLYQVP PGPGGP AQD I YQVPPS AGMGHD I YQVPPSMDTRS WEGTKPPA 
KVWPTRVGQG YVYEAAQPEQDEYD I PRHLLAPGPQD I YDVPPVRGLIiPSQYGQEVYDTP PMAVKGPNGR 

40 DPLLEVYDVPPSVEKGLPPSNHHAVYDVPPSVSKDVPDGPIjLREETYDVPPAFAKAKPFDPART 

PPPDSPPAEDVYDVPPPAPDIjYDVPPGLRRPGPGTLYDVPRERVLPPEVADGGVVBSGVYAVPPPAEREA 

paegkrlsasstgstrssqsasslevagpgreplelevaveai^lqqgvsatvahlldlagsagatgsw 
rspsepqeplvqdlqaavaavqsawellefarsavgnaahtsdralhakijsrqiiqkmedvhqtlvahgq 
aldagrggsgatledldrlvacsravpedakqiiasflhgnasllfrrtkatapgpegggtlhpnptdkts 
45 siqsrplpsppkftsqdspdgqyenseggwmedydyvhlqgkeefektqkellekgsitrqgksqlelqq 
lkqferleqevsrpidhdlaiwtpaqplapgrtc^ 

tnqppkifvahskfvilsahklvfigdtlsrqakaadvrsqvthysl^lcdllrgivattkaaalqypsp 
saaqdmvervkelghstqqfrrvlgqlaaa 

50 SEQ ID NO:353 
Perlecan 

>GL2_JB7_B06_G3F1 Direction: N/A 

ACCGCTCCTCTCCCGAGGGCACACAACCAACCAAACTGGTGCTCTGAAACAGCGCCAGTGTCCAGTTCTCTGAGG 
AGCACTGGGTCCATGAGGTCTTGGNCGGGCGGTGCAGCTGCGCTGGAGCTGCTGGAGTGTGCGTGGAAGANACTG 

55 TNGAGTGGCGTGGNTCATTCAAGAACNGGTGTTCCGGAWGAAAACCAGAACAAGGTGGGTTAGATANGGAATAAA 
AGGGGGGCCGGATCGAATAAATCTTGAAGGGTCTTGAANTNAATCGTNCCAAACTTTGGAACACTTGGTCCCGGG 
TTTTCAACAGTGGTGGGCGAACATTTTTCATCTAGCAGTGGGACCCGGCGGTTAAATAATTTCTATAAATATACT 
GGGCC^GTAATAAATAGATTGGAAGNGTTCACGAGTAATTTTTAACACAACATTTCCAACCAT^ 
AGCGACCGAGAGTAATANTTAAGAAAAACGGGTTACCGTTO 

60 AGACCCATTTATGAGGACGCGGGTNTATNTAACCAGCCCCAAAGAANACAGGTTTAAAAAAGTTTGATGG 
CAGCTTTNTTGGTCGAAGTTNTGAAGACAAAGAATTTT 
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AAAAGAGGATTTGTGTNCGGACACAGGCGTTAAAAAATTAAATTACCGCGNATAAT^AGATGAAAATAGTTGGTGC 
GCCGAGGNCGAAAACGCTCGGAAAGTATTCTCNGGCAGCGCC^^ 

ATAATTGTGCTAGCCGGTCGTCAAOTAGATGACGAACTAAAATATNATAANTATGGGGTTTAACACGTGTGGGTG 
AGTAAGAGAGGAATTTAAAAAACAGGGTGTGGTTTATACATAAACCATTATATTTTTATAATAGCGAATGANAGG 
TAAAAGAGACCCGGCNGGAGATAATATCTAGTGGGTACCAGATTGGTGAATAATAGGTACTGGGGTAAATTTGTT 



SEQEDNO:354 

>GL2_37_B01_G3F1 Direction: N/A 

ACCGCTCCTCTCCCGAGGGCACACACCCACCCAAACTGGTGCTCTGAACCAGCGCCAGGTCCAGTTCTCTGAGGA 
10 GCACTGGGTCCATGAGTCTGGNCGGCCGGTGCAGCGCGCGGAGCTGCTGCAGGTGCTGCAGAGCCTGGAGTGGCG 
TGCTCATCCAGAACTGTGTTCTGCGAGAAACCAGCACAGTGGTTAGATATGATAAAGCGGGCGCTCTGACTAAAT 
CTGAGGTCTTGATACTCACTGACTGTCGTAAGGGNGGAATTCTGCGGCCGCGTAAATTCAAATTCTGCCCTATAN 
GTGAGTCCGTATTACAATTCACTTGGGGCGTCGTTTTACAACAGTCTGTGACTGGGAAAAAACCTGGCGTTACCC 
ACACATTAATCTGCCTTGCAGCGACATTCCCCTTTCTGCCAGACTGGNCGTAATAGCGAAGAGGGGCCGCGACAC 
15 GATCTGCCCTTCCCAACAGTTGCGCAGAGCTATACGGTACAGGGGGANGATGTTAAAGCGGNTGTTACAAACGTA 
TOTAAAAGAGGTAAAGAAAAGAACCGCAGGGTGTGTAAAATNCACGGAGGCNCTTANGGTGTATTGGTGCGCGGC 
GATTTGTTTNCCCAACGGCAAGNTTGGGAATACAATTTAAATTATTGAAACAAAGNGTGACCNCGATGTGTGGTG 
TGGAANAGGGAATATAGGGGTGT 



20 SEQIDNO:355 

>GL2_3 7JB02_G3F1 Direction: sense 

ACCGCCTCCTCTCCCGAGGCCACACACCCACCCAACCTGGTGCTCTGAACCAGCGCCAGGTCCAGTTCTCTGAGG 
AGCACTGGGTCCATGAGTCTGGCCGGCCGGTGCAGCGCGCGGAGCTGCTGCAGGTGCTGCAGAGCCTGGAGGCCG 
TGCTCATCCAGACCGTGT 

25 

SEQIDNO:356 

>GL2-86-lM13F Direction: sense 

CTGGTGGGGATGGGGACCGCCTGCCCAGGGGTGAGCTGCCTTTTGCTCCACAGCCGACACTAAAGACAATTCCCA 
ATCCTGAGTGGGTGGCAGAGACTCCTGCGATGCCCGTCTCAGGTAGCTGTGGGGCACCAGCCCACAAGCCGAGGT 
3 0 TGGCTCTCCTAGGAGTGAGAACTGCCCAAGGGCTGCAGAAACAGGCCACCCAGCTCTATCTGGGGGCTCCATCGG 
TGGGTAGGGGGACAGTGGGGGCAGTTCTGGGCCCACCCAGCCACTGTTCCTGACCCCAAGTCCTGGTGACTTTCT 
GAGGTGCCCACTCCCATCCAACCTGCCTTGCTGGCCAGCCTTGTGGCTTTGCCCAGCTGTGTGTGTGAGGGTGGC 
ATGCCCACCTCCAGTCCAGCCCAGGGCGGTA 



35 SEQIDNO:357 

>GL2-86-2M13F Direction: sense 

CTGGTGGGGATGGGGACCGCCTGCCCAGGGGTGAGCTGCCTTTTGCTCCACAGCGGCACTAAAGACAATTCCCAA 
TCCTGAGTGGGTGGCAGAGACTCCTGCGATGCCCGTCTCANGTAGCTGTGK3GGCACCAGCCCACAAGCCGAGGTT 
GGNTCTCCTAGGAGTGAGAACTGCCCAAGGGCTGCAGAAACAGGCCACCCAGCTCTATCTGGGGGCTCCAATCGG 

40 TGGGTAGGGGGACAGTGGGGGCAGTTCTGGGCCCACCCAGCACTGTTCCTGAACCCAAGTCCTGGTGACTTTCTG 
AGGTGCCCACTCCCATCCAAACTGGCTTGCTGGNCAGCCTTGTGGCTTTGNCCAGCTGTGTGTGTGAGGGTGGCA 
TGCCCANCTCCAGTCCAGCCCAGGGCGGTAGCAGCAAAGCGTGGCATCGACTCGGNNTTTCTTACAAAAAATTCA 
TAAATAATATTCACTAATAATATACTCGGAACATTTGTCGGGGCTTGGGGGCGTTGGCCCCGGGGAAGTCCAGTG 
TTGGGGGGCAAGGCCAGGTTGGCCCTANACGAAGGGGGGCCAGGGGCCGTTGTGTTTGGGCCCCGGGCCCTGGGG 

45 CGCNGGTGCTTGCACAGGGC 



SEQ ID NO:358 

>GL2-86-3M13F Direction: sense 

CTGGTGGGGATGGGGACCGCCTGCCCAGGGGTGAGCTGCCTTTTGCTCCACAGCCGGCACTAAAGACAATTCCCA 
50 ATCCTGAGTGGGTGGCAGAGACTCCTGCGATGCCCGTCTCAGGTAGCTGTGGGGCACCAGCCCACAAGCCGAGGT 
TGGCTCTCCTAGGAGTGAGAACTGCCCAAGGGCTGCAGAAACAGGCCACCCAGCTCTATCTGGGGGCTCCATCGG 
TGGGTAGGGGGACAGTGGGGGCAGTTCTGGGCCCACCCAGCCACTGTTCCTGACCCCAAGTCCTGGTGACTTTCT 
GAGGTGCCCACTCCCATCCAACCTGCCTTGCTGGCCAGCCTTGTGGCTTTGCCCAGCTGTGTGTGTGAGGGTGGC 
ATGCCGACCTCCAGTCCAGCCCAGGGCGGTAGCAGCAAAGCGTGGC^TCGCCTCGGTTTCTTACAAAAATTCATA 
5 5 ATAATATTAATAATAATATACTCGACATTGTCGGGCTGG 



. SEQ ID NO:359 

>GL2-86-5M13F Direction: sense 
CTGGTGGGGATGGGGACCGCCTGCCCGGGGGTGAGCTGCCT 
60 ATCCTGAGTGGGTGGCAGAGACTCCTGCGATGCCCGTCTCAGGTAGCTGTGGGGCACCAGCCCACAAGCCGAGGT 
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TGGCTCTCCTAGGAGTGAGAACTGCCCAAGGGCTGCAGAAACAGGCCACCCAGCTCTATCTGGGGGCTCCATCGG 
TGGGTAGGGGGACAGTGGGGGCAGTTCTGGGCCCACCCAGCCACTGTNCCTGACCCCAAGTCCTGGTGACTTTCT 
GAGGTGCCCACTCCACATCCAACCTGCCTTGCTGGNCAGCCTTGNTGGCATAATGCCCAGNCTGATAGATGCGTG 
AGGGNGGACATGCCCACCTACCAGTACCAGACCCAGGGCGGATAGGCAGCAAAGNACGTGGCATCGCCTCGGG 

SEQ ID NO:360 

>GL2-86-lM13R Direction: N/A 

CTGGTCAGCGGCCGGTCCCCAGGTCCCAACGTGGCAGTCAACGCCAAGGGCAGCGTCTACATCGGCGGAGCCCCT 
GACGTGGCCACGCTGACCGGGGGCAGATTCTCCTCGGGCATCACAGGCTGTGTCAAGAACCTGGTGCTGCACTCG 
10 GCCCGACCCGGCGCCCCGCCCCCACAGCCCCTGGACCTGCAGACCGGCCCAGGCCGGGGCCAACACACGCCCCTG 
CCCCTCGTAGGCACCTGCCTGCCCCACACGGACTCCCGGGCCACGCCCCAGCCCGACAATGTCGAGTATATTATT 
ATTAATATTATTATGAA 

SEQIDNO:361 

15 >GL2-86-2M13R Direction: N/A 

CTGGTCAGCGGCCGGTCCCCAGGTCCCAACGTGGCAGTCAACGCCAAGGGCAGCGTCTACATCGGCGGAGCCCCT 
GACGTGGCCACGCTGACCGGGGGCAGATTCTCCTCGGGCATCACAGGCTGTGTCAAGAACCTGGTGCTGCACTCG 
GCCCGACCCGGCGCCCCGCCCCCACAA 

20 SEQ ID NO:362 

>GL2-86-3M13R Direction: N/A 

TTGGTCAGCGGCCGGTCCCCAGGTCCCAACGTGGCAGTCAACGCCAAGGGCAGCGTCTACATCGGCGGAGCCCCT 
GACGTGGCCACGCTGACCGGGGGCAGATTCTCCTCGGGCATCACAGGCTGTGTCAAGAACCTGGTGCTGCACTCG 
GCCCGACCCGGCGCCCCGCCCCCACAGCCCCTGGACCTGCAGCACCGCGCCCAGGCCGGGGCCAACACACGCCCC 
25 TGCCCCTCGTAGGCACCTGCCTGCCCCACACGGACTCCCGGGCCACGCCCCAGCCCGACAATGTCGAGTATATTA 
TTATTAATATTATTATGAATTTTNNGTAAGAAACCGAGGCGATGCCACGCTTANGCTGCTACCGCCCTGGGCTGG 
ACTGGAGGTGGGCATGCCACC 

SEQ ID NO:363 

30 >GL2-88-5M13R Direction: anti-sense 

CTGGTCAGCGGCCGGTCCCCAGGTCCCAACGTGGCAGTCAACGCCAAGGGCAGCGTCTACATCGGCGGAGCCCCT 
GACGTGGCCACGCTGACCGGGGGCAGATTCTCCTCGGGCATCACAGGCTGTGTCAAGAACCTGGTGCTGCACTCG 
GCCCGACCCGGCGCCCCGCCCCACAGCCCCTGGACCTGCAGCACCGCGCCCAGGCCGGGGCCAACACACGCCCCT 
GCCCCTCGTAGGCACCTGCCTGCCCCACACGGACTCCCGGGCCACGCCCCAGCCGACAATGTCGAGTATATTATT 

35 ATTAACTATTATTATGAACTTTTTGTAAGAAACCGAGGCGATGCCACGCTTTGCTGCTACCGGCCTGGGGCTGGA 

CTGGAGGTGGGCATGCCAACACCATCACGGCACAACAGCTGGCAAAAGCCAACAAAGGGCCTTGGGCCAGGCAAG 

GGCACAGGTTGGGACTGGGAGTGGGGCACCCTCANGAAAGGTCACCCAGGGAACTATGGGGGGGTNCCAGAGGAA 

ACCACAGGTAGGGGACCTGGGGGTTTNGGGGGCCCCCCAGGAAAACTTGGGGCGCCCACAAANATNGNTACCACC 

ACCATAAACACACAAGCCCAGAANTGTGAAAACCCCCAACCCAGAGAAANTAGCAGAGCCATCGAGGAAAGCACC 
40 CCAAGATACACAGCANNACCCCATGAGAA 

SEQ ID NO:364 

>GL2__93_2B09_G3F1 Direction: N/A 

AAGCTGCCTTTTGCTCCACAGCCGGCACTAAAGACAATTCCCAATCCTGAGTGGGTGGCAGAGACTCCCTTTGGG 
45 GCGATGCCCGTCTTCCAGGTAAGGCTGTGGTGGGCGGGCCAACACACCAAAAAGGGGCCCCCCCCAAAAATAGCC 
CCATACAAACAAAACACGNGGGGCCCATACCCCACGGGAAAAGGGGGGTTTATGGGTGGGCCATTTTCCCCCAGT 
TTCCACCCGTTTAACAGGCGNGGCAACACNGCTTTTTGGGCGTGNCTGGAACAACAACCCTTTTGGGTCTCCTCC 
NCCCAAAAAGTAGGGTGGGCTGGGCGGCCCCTTTTATGTGTGGCCCAANACNGGAGAACACAATACTAAATAATC 
CCAAACTGGTGTGGGGTGTCCCGCCCAAATANCACCCACCCCAANTAGATGCCCCTTTTACCCCGTTTTATAAAA 
5 0 AATTTTTTTTTCCCCCTATATCTTGGCGGGGGCGAGGGGATATGT 

SEQ ID NO:365 

>gi | 7427516 | ref | NM_005529 . 2 | Homo sapiens heparan sulfate proteoglycan 2 
(perlecan) (HSPG2) , mRNA 

55 GCCCGGAGCGAGCGAGCGAGAGAGCGGCGCGGGCCGGGCCATGGGGTGGCGGGCGCCGGGCGCGCTGCTG 
CTGGCGCTGCTGCTGCACGGGCGGCTGCTGGCGGTGACCCATGGGCTGAGGGCATACGATGGCTTGTCTC 
TGCCTGAGGACATAGAGACCGTCACAGCAAGCCAAATGCGCTGGACACATTCGTACCTTTCTGATGATGA 
GGACATGCTGGCTGACAGCATCTCAGGAGACGACCTGGGCAGTGGGGACCTGGGCAGCGGGGACTTCCAG 
ATGGTTTATTTCCGAGCCCTGGTGAATTTCACTCGCTCCATCGAGTACAGCCCTCAGCTGGAGGATGCAG 

60 GCTCCAGAGAGTTTCGAGAGGTGTCCGAGGCTGTGGTAGACACGCTGGAGTCGGAGTACTTGAAAATTCC 
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CGGAGACCAGGTTGTCAGTGTGGTGTTCATCAAGGAGCTGGATGGCTGGGTTTTTGTGGAGCTGGATGTG 
GGCTCGGAAGGGAATGCGGATGGGGCTCAGATTCAGGAGATGCTGCTCAGGGTCATCTCCAGCGGCTCTG 
TGGCCTCCTACGTCACCTCTCCCCAGGGATTCCAGTTCCGACGCCTGGGCACAGTGCCCCAGTTCCCAAG 
AGCCTGCACGGAGGCCGAGTTTGCCTGCCACAGCTACAATGAGTGTGTGGCCCTGGAGTATCGCTGTGAC 
5 CGGCGGCCCGACTGCAGGGACATGTCTGATGAGCTCAATTGTGAGGAGCCAGTCCTGGGTATCAGCCCCA 
CATTCTCTCTCCTCGTGGAGACGACATCTTTACCGCCCCGGCCAGAGACAACCATCATGCGACAGCCACC 
AGTCACCCACGCTCCTCAGCCCCTGCTTCCCGGTTCCGTCAGGCCCCTGCCCTGTGGGCCCCAGGAGGCC 
GCATGCCGCAATGGGCACTGCATCCCCAGAGACTACCTCTGCGACGGACAGGAGGACTGCGAGGACGGCA 
GCGATGAGCTAGACTGTGGCCCCCCGCCACCCTGTGAGCCCAACGAGTTCCCCTGCGGGAATGGACATTG 

10 TGCCCTCAAGCTGTGGCGCTGCGATGGTGACTTTGACTGTGAGGACCGAACTGATGAAGCCAACTGCCCC 
ACCAAGCGTCCTGAGGAAGTGTGCGGGCCCACACAGTTCCGATGCGTCTCTACCAACATGTGCATCCCAG 
CCAGCTTCCACTGTGACGAGGAGAGCGACTGTCCTGACCGGAGCGACGAGTTTGGCTGCATGCCCCCCCA 
GGTGGTGACACCTCCCCGGGAGTCCATCCAGGCTTCCCGGGGCCAGACAGTGACCTTCACCTGCGTGGCC 
ATTGGCGTCCCCGCCCCCTTTCTCATCAATTGGAGGCTCAACTGGGGCCACATCCCCTCTCAGCCCAGGG 

15 TGACAGTGACCAGCGAGGGTGGCCGTGGCACACTGATCATCCGTGATGTGAAGGAGTCAGACCAGGGTGC 
CTACACCTGTGAGGCCATGAACGCCCGGGGCATGGTGTTTGGCATTCCTGACGGTGTCCTTGAGCTCGTC 
CCACAACGAGCAGGCCCCTGCCCTGACGGCCACTTCTACCTGGAGCACAGCGCCGCCTGCCTGCCCTGCT 
TCTGCTTTGGCATCACCAGCGTGTGCCAGAGCACCCGCCGCTTCCGGGACCAGATCAGGCTGCGCTTTGA 
CCAACCCGATGACTTCAAGGGTGTGAATGTGACAATGCCTGCGCAGCCCGGCACGCCACCCCTCTCCTCC 

20 ACGCAGCTGCAGATCGACCCATCCCTGCACGAGTTCCAGCTAGTCGACCTGTCCCGCCGCTTCCTCGTCC 
ACGACTCCTTCTGGGCTCTGCCTGAACAGTTCCTGGGCAACAAGGTGGACTCCTATGGCGGCTCCCTGCG 
TTACAACGTGCGCTACGAGTTGGCCCGTGGCATGCTGGAGCCAGTGCAGCGGCCGGACGTGGTCCTCGTG 
GGTGCCGGGTACCGCCTCCTCTCCCGAGGCCACACACCCACCCAACCTGGTGCTCTGAACCAGCGCCAGG 
TCCAGTTCTCTGAGGAGCACTGGGTCCATGAGTCTGGCCGGCCGGTGCAGCGCGCGGAGCTGCTGCAGGT 

25 GCTGCAGAGCCTGGAGGCCGTGCTCATCCAGACCGTGTACAACACCAAGATGGCTAGCGTGGGACTTAGC 
GACATCGCCATGGATACCACCGTCACCCATGCCACCAGCCATGGCCGTGCCCACAGTGTGGAGGAGTGCA 
GATGCCCCATTGGCTATTCTGGCTTGTCCTGCGAGAGCTGTGATGCCCACTTCACTCGGGTGCCTGGTGG 
GCCCTACCTGGGCACCTGCTCTGGTTGCAGTTGCAATGGCCATGCCAGCTCCTGTGACCCTGTGTATGGC 
CACTGCCTGAATTGCCAGCACAACACGGAGGGGCCACAGTGCAAGAAGTGCAAGGCTGGCTTCTTTGGGG 

30 ACGCCATGAAGGCCACGGCCACTTCCTGCCGGCCCTGCCCTTGCCCATACATCGATGCCTCCCGCAGATT 
CTCAGACACTTGCTTCCTGGACACGGATGGCCAAGCCACATGTGACGCCTGTGCCCCAGGCTACACTGGC 
CGCCGCTGTGAGAGCTGTGCCCCCGGATACGAGGGCAACCCCATCCAGCCCGGCGGGAAGTGCAGGCCCG 
TCAACCAGGAGATTGTGCGCTGTGACGAGCGTGGCAGCATGGGGACCTCCGGGGAGGCCTGCCGCTGTAA 
GAACAATGTGGTGGGGCGCTTGTGCAATGAATGTGCTGACCGCTCTTTCCACCTGAGTACCCGAAACCCC 

35 GATGGCTGCCTCAAGTGCTTCTGCATGGGTGTCAGTCGCCACTGCACCAGCTCTTCATGGAGCCGTGCCC 
AGTTGCATGGGGCCTCTGAGGAGCCTGGTCACTTCAGCCTGACCAACGCCGCAAGCACCCACACCACCAA 
CGAGGGCATCTTCTCCCCCACGCCCGGGGAACTGGGATTCTCCTCCTTCCACAGACTCTTATCTGGACCC 
TACTTCTGGAGCCTCCCTTCACGCTTCCTGGGGGACAAGGTGACCTCCTATGGAGGAGAGCTGCGCTTCA 
CAGTGACCCAGAGGTCCCAGCCGGGCTCCACACCCCTGCACGGGCAGCCGTTGGTGGTGCTGCAAGGTAA 

40 CAACATCATCCTAGAGCACCATGTGGCCCAGGAGCCCAGCCCCGGCCAGCCCAGCACCTTCATTGTGCCT 
TTCCGGGAGCAAGCATGGCAGCGGCCCGATGGGCAGCCAGCCACACGGGAGCACCTGCTGATGGCACTGG 
CAGGCATCGACACCCTCCTGATCCGAGCATCCTACGCCCAGCAGCCCGCTGAGAGCAGGGTCTCTGGCAT 
CAGCATGGACGTGGCTGTGCCCGAGGAAACCGGCCAGGACCCCGCGCTGGAAGTGGAACAGTGCTCCTGC 
CCACCCGGGTACCGTGGGCCGTCCTGCCAGGACTGTGACACAGGCTACACACGCACGCCCAGTGGCCTCT 

45 ACCTGGGTACCTGTGAACGCTGCAGCTGCCATGGCCACTCAGAGGCCTGCGAGCCAGAAACAGGTGCCTG 
CCAGGGCTGCCAGCATCACACGGAGGGCCCTCGGTGTGAGCAGTGCCAGCCAGGATACTACGGGGACGCC 
CAGCGGGGGACACCACAGGACTGCCAGCTGTGCCCCTGCTACGGAGACCCTGCTGCCGGCCAGGCTGCCC 
ACACTTGTTTTCTGGACACAGACGGCCACCCCACCTGTGATGCGTGCTCCCCAGGCCACAGTGGGCGTCA 
CTGTGAGAGGTGCGCCCCTGGCTACTATGGCAACCCCAGCCAGGGCCAGCCATGCCAGAGAGACAGCCAG 

50 GTGCCAGGGCCCATAGGCTGCAACTGTGACCCCCAAGGCAGCGTCAGCAGCCAGTGTGATGCTGCTGGTC 
AGTGCCAGTGCAAGGCCCAGGTGGAAGGCCTCACTTGCAGCCACTGCCGGCCCCACCACTTCCACCTGAG 
TGCCAGCAACCCAGACGGCTGCCTGCCCTGCTTCTGTATGGGCATCACCCAGCAGTGCGCCAGCTCTGCC 
TACACACGCCACCTGATCTCCACCCACTTTGCCCCTGGGGACTTCCAAGGCTTTGCCCTGGTGAACCCAC 
AGCGAAACAGCCGCCTGACAGGAGAATTCACTGTGGAACCCGTGCCCGAGGGTGCCCAGCTCTCTTTTGG 

55 CAACTTTGCCCAACTCGGCCATGAGTCCTTCTACTGGCAGCTGCCGGAGACATACCAGGGAGACAAGGTG 
GCGGCCTACGGTGGGAAGTTGCGATACACCCTCTCCTACACAGCAGGCCCACAGGGCAGCCCACTCTCTG 
ACCCCGATGTGCAGATCACGGGCAACAACATCATGCTAGTGGCCTCCCAGCCAGCGCTGCAGGGCCCTGA 
GAGGAGGAGCTACGAGATCATGTTCCGAGAGGAATTCTGGCGCCGGCCCGATGGGCAGCCGGCCACACGC 
GAGCACCTCCTGATGGCACTGGCCGACCTGGATGAGCTCCTGATCCGGGCCACGTTCTCCTCCGTGCCGC 

60 TGGTGGCCAGCATCAGCGCAGTCAGCCTGGAGGTCGCCCAGCCGGGGCCCTCAAACAGACCCCGCGCCCT 
CGAGGTGGAGGAGTGCCGCTGCCCGCCAGGCTACATCGGTCTGTCCTGCCAGGACTGTGCCCCCGGCTAC 
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ACGCGCACCGGGAGTGGGCTCTACCTCGGCCACTGCGAGCTATGTGAATGCAATGGCCACTCAGACCTGT 

GCCACCCAGAGACTGGGGCCTGCTCGCAATGCCAGCACAACGCCGCAGGGGAGTTCTGCGAGCTTTGTGC 

CCCTGGCTACTACGGAGATGCCACAGCCGGGACGCCTGAGGACTGCCAGCCCTGTGCCTGCCCACTGACC 

AACCCAGAGAACATGTTTTCCCGCACCTGTGAGAGCCTGGGAGCCGGCGGGTACCGCTGCACGGCCTGCG 

AACCCGGCTACACTGGCCAGTACTGTGAGCAGTGTGGCCCAGGTTACGTGGGTAACCCCAGTGTGCAAGG 

GGGCCAGTGCCTGCCAGAGACAAACCAAGCCCCACTGGTGGTCGAGGTCCATCCTGCTCGAAGCATAGTG 

CCCCAAGGTGGCTCCCACTCCCTGCGGTGTCAGGTCAGTGGGAGGGGACCCCACTACTTCTATTGGTCCC 

GTGAGGATGGGCGGCCTGTGCCCAGCGGCACCCAGCAGCGACATCAAGGCTCCGAGCTCCACTTCCCCAG 

CGTCCAGCCCTCGGATGCTGGGGTCTACATTTGCACCTGCCGTAATCTCCACCGATCCAATACCAGCCGG 

GCAGAGCTGCTGGTCACTGAGGCTCCAAGCAAGCCCATCACAGTGACTGTGGAGGAGCAGCGGAGCCAGA 

GCGTGCGCCCCGGAGCTGACGTCACCTTCATCTGCACAGCCAAAAGCAAGTCCCCAGCCTATACCCTGGT 

GTGGACCCGCCTGCACAACGGGAAACTGCCCACCCGAGCCATGGATTTCAATGGCATCCTGACCATTCGC 

AACGTCCAGCTGAGTGATGCAGGCACCTACGTGTGCACCGGCTCCAACATGTTTGCCATGGACCAGGGCA 

CAGCCACTCTACATGTGCAGGCCTCGGGCACCTTGTCCGCCCCCGTGGTCTCCATCCATCCGCCACAGCT 

CACAGTGCAGCCCGGGCAACTGGCGGAGTTCCGCTGCAGCGCCACAGGGAGCCCCACGCCCACCCTCGAG 

TGGACAGGGGGCCCCGGCGGCCAGCTCCCTGCGAAGGCACAAATCCACGGCGGCATCCTGCGCCTGCCAG 

CTGTCGAGCCCACGGATCAGGCCCAGTACTTGTGCCGAGCCCACAGCAGCGCTGGGCAGCAGGTGGCCAG 

GGCTGTGCTCCACGTGCATGGGGGCGGTGGGCCCAGAGTCCAAGTGAGCCCAGAGAGGACCCAGGTCCAC 

GCAGGCCGCACCGTCAGGCTGTACTGCAGGGCTGCAGGCGTGCCTAGCGCCACCATCACCTGGAGGAAGG 

AAGGGGGCAGCCTCCCACCACAGGCCCGGTCAGAGCGCACAGACATCGCGACACTGCTCATCCCAGCCAT 

CACGACTGCTGACGCCGGCTTCTACCTCTGCGTGGCCACCAGCCCTGCAGGCACTGCCCAGGCCCGGATC 

CAAGTGGTTGTCCTTTCAGCCTCAGATGCCAGCCAACCGCCGGTCAAGATTGAGTCCTCATCGCCTTCTG 

TGACAGAAGGGCAAACACTCGACCTCAACTGTGTGGTGGCAGGGTCAGCCCATGCCCAGGTCACCTGGTA 

CAGGCGAGGGGGTAGCCTGCCTCACCACACCCAGGTGCACGGCTCCCGTCTGCGGCTCCCCCAGGTCTCA 

CCAGCTGATTCTGGAGAGTATGTGTGCCGTGTGGAGAATGGATCGGGCCCCAAGGAGGCCTCCATTACTG 

TGTCTGTGCTCCACGGCACCCATTCTGGCCCCAGCTACACCCCAGTGCCCGGCAGCACCCGGCCCATCCG 

CATCGAGCCCTCCTCCTCACACGTGGCGGAAGGGCAGACCCTGGATCTGAACTGCGTGGTGCCCGGGCAG 

GCCCACGCCCAGGTCACGTGGCACAAGCGTGGGGGCAGCCTCCCTGCCCGGCACCAGACCCACGGCTCGC 

TGCTGCGGCTGCACCAGGTGACCCCGGCCGACTCAGGCGAGTATGTGTGCCATGTGGTGGGCACCTCCGG 

CCCCCTAGAGGCCTCAGTCCTGGTCACCATCGAAGCCTCTGTCATCCCTGGACCCATCCCACCTGTCAGG 

ATCGAGTCTTCATCCTCCACAGTGGCCGAGGGCCAGACCCTGGATCTGAGCTGCGTGGTGGCAGGGCAGG 

CCCACGCCCAGGTCACATGGTACAAGCGTGGGGGCAGCCTCCCTGCCCGGCACCAGGTTCGTGGCTCCCG 

CCTGTACATCTTCCAGGCCTCACCTGCCGATGCGGGACAGTACGTCTGCCGGGCCAGCAACGGCATGGAG 

GCCTCCATCACGGTCACAGTAACTGGGACCCAGGGGGCCAACTTAGCCTACCCTGCCGGCAGCACCCAGC 

CCATCCGCATCGAGCCCTCCTCCTCGCAAGTGGCGGAAGGGCAGACCCTGGATCTGAACTGCGTGGTGCC 

CGGGCAGTCCCATGCCCAGGTCACGTGGCACAAGCGTGGGGGCAGCCTCCCTGTCCGGCACCAGACCCAC 

GGCTCCCTGCTGAGACTCTACCAAGCGTCCCCCGCCGACTCGGGCGAGTACGTGTGCCGAGTGTTGGGCA 

GCTCCGTGCCTCTAGAGGCCTCTGTCCTGGTCACCATTGAGCCTGCGGGCTCAGTGCCTGCACTTGGGGT 

CACCCCCACGGTCCGGATCGAGTCATCGTCTTCGCAAGTGGCAGAGGGGCAGACCCTGGACCTGAACTGC 

CTCGTTGCTGGTCAGGCCCATGCCCAGGTCACGTGGCACAAGCGCGGGGGCAGCCTCCCGGCCCGGCACC 

AGGTGCATGGCTCGAGGCTACGCCTGCTCCAGGTGACCCCAGCTGATTCAGGGGAGTACGTGTGCCGTGT 

GGTCGGCAGCTCAGGTACCCAGGAAGCCTCAGTCCTTGTCACCATCCAGCAGCGCCTTAGTGGCTCCCAC 

TCCCAGGGTGTGGCGTACCCCGTCCGCATCGAGTCCTCCTCAGCCTCCCTGGCCAATGGACACACCCTGG 

ACCTCAACTGCCTGGTTGCCAGCCAGGCTCCCCACACCATCACCTGGTATAAGCGTGGAGGCAGCTTACC 

CAGCCGGCACCAGATCGTGGGCTCCCGGCTGCGGATCCCTCAGGTGACTCCGGCAGACTCGGGCGAGTAC 

GTGTGTCACGTCAGTAACGGTGCAGGCTCCCGGGAGACCTCGCTCATCGTCACCATCCAGGGCAGCGGTT 

CCTCCCACGTGCCCAGAGTCTCCCCACCGATCAGGATCGAGTCGTCTTCCCCCACGGTGGTGGAAGGGCA 

GACCTTGGATCTGAACTGCGTGGTCGCCAGGCAGCCCCAGGCTATCATCACATGGTACAAGCGTGGGGGC 

AGCCTTCCCTCCCGACACCAGACCCATGGCTCCCACCTGCGGTTGCACCAAATGTCTGTGGCTGACTCGG 

GCGAGTATGTGTGCCGGGCCAACAACAACATCGATGCCCTGGAGGCCTCAATCGTCATCTCCGTCTCCCC 

TAGCGCCGGCAGCCCCTCCGCCCCTGGCAGCTCCATGCCCATCAGAATTGAGTCATCCTCCTCACACGTG 

GCCGAAGGGGAGACCCTGGATCTGAACTGCGTGGTCCCCGGGCAGGCCCATGCCCAGGTCACTTGGCACA 

AGCGTGGGGGCAGCCTCCCCAGTTACCATCAGACCCGCGGCTCACGGCTGCGGCTGCACCATGTGTCCCC 

GGCCGACTCGGGTGAATACGTGTGCCGGGTGATGGGCAGCTCTGGCCCCCTGGAGGCCTCAGTCCTCGTC 

ACCATCGAAGCCTCTGGCTCAAGTGCTGTCCACGTCCCCGCCCCAGGTGGAGCCCCACCCATCCGCATCG 

AGCCCTCCTCCTCCCGAGTGGCAGAAGGGCAGACCCTGGATCTGAAGTGCGTGGTGCCCGGGCAGGCCCA 

CGCCCAGGTCACGTGGCACAAGCGTGGAGGAAACCTCCCTGCCCGGCACCAGGTCCACGGCCCACTGCTG 

AGGCTGAACCAGGTGTCCCCGGCTGACTCTGGCGAGTACTCGTGCCAAGTGACCGGAAGCTCAGGCACCC 

TGGAGGCATCTGTCCTGGTCACAATTGAGCCCTCCAGCCCAGGACCCATTCCTGCTCCAGGACTGGCCCA 

GCCCATCTACATCGAGGCCTCCTCTTCACACGTGACTGAAGGGCAGACTCTGGATCTGAACTGTGTGGTG 

CCCGGGCAGGCCCATGCCCAGGTCACGTGGTACAAGCGCGGGGGCAGCCTCCCCGCCCGGCACCAGACCC 



88 



WO 2004/019893 PCT/US2003/027523 

ATGGCTCCCAGCTGCGGCTCCACCACGTCTCCCCTGCCGACTCAGGCGAGTATGTGTGTCGTGCAGCCGG 
CGGCCCAGGCCCTGAGCAAGAAGCCTCCTTCACAGTCACCGTCCCGCCCAGTGAGGGGTCTTCCTACCGC 
CTTAGGAGCCCGGTCATCTCGATCGACCCGCCCAGCAGCACCGTGCAGCAGGGCCAGGATGCCAGCTTCA 
AGTGCCTCATCCATGACGGGGCAGCCCCCATCAGCCTCGAGTGGAAGACCCGGAACCAGGAGCTGGAGGA 
5 CAACGTCCACATCAGTCCCAATGGCTCCATCATCACCATCGTGGGCACCCGGCCCAGCAACCACGGTACC 
TACCGCTGCGTGGCCTCCAATGCCTACGGTGTGGCCCAGAGTGTGGTGAACCTCAGTGTGCACGGGCCCC 
CTACAGTGTCCGTGCTCCCCGAGGGCCCCGTGTGGGTGAAAGTGGGAAAGGCCGTCACCCTGGAGTGTGT 
CAGTGCCGGGGAGCCCCGCTCCTCTGCTCGTTGGACCCGGATCAGCAGCACCCCTGCCAAGTTGGAGCAG 
CGGACATATGGGCTCATGGACAGCCACACGGTGCTGCAGATTTCATCAGCTAAACCATCAGATGCGGGCA 
10 CTTATGTGTGCCTTGCTCAGAATGCACTAGGCACAGCACAGAAGCAGGTGGAGGTGATCGTGGACACGGG 
CGCCATGGCCCCAGGGGCCCCTCAGGTCCAAGCTGAAGAAGCTGAGCTGACTGTGGAGGCTGGACACACG 
GCCACCTTGCGCTGCTCAGCCACAGGCAGCCCCGCGCGCACCATCCACTGGTCCAAGCTGCGTTCCCCAC 
TGCCCTGGCAGCACCGGCTGGAAGGTGACACACTCATCATACCCCGGGTAGCCCAGCAGGACTCGGGCCA 
GTACATCTGCAATGCCACTAGCCCTGCCGGGCACGCTGAGGCCACCATCATCCTGCACGTGGAGAGCCCA 
15 CCATATGCCACCACGGTCCCAGAGCACGCTTCGGTGCAGGCAGGGGAGACGGTGCAGCTCCAGTGCCTGG 
CTCACGGGACACCCCCACTCACCTTCCAGTGGAGCCGCGTGGGCAGCAGCCTTCCTGGGAGGGCGACCGC 
CAGGAACGAGCTGCTGCACTTTGAGCGTGCAGCCCCTGAGGACTCAGGCCGCTACCGCTGCCGGGTCACC 
AACAAGGTGGGCTCAGCCGAGGCCTTTGCCCAGCTGCTCGTCCAAGGCCCTCCCGGCTCTCTCCCTGCCA 
CCTCCATCCCAGCAGGGTCCACGCCCACCGTGCAGGTCACGCCTCAGCTAGAGACCAAGAGCATTGGGGC 
20 CAGCGTTGAGTTCCATTGTGCTGTGCCCAGCGACCGGGGTACCCAGCTCCGTTGGTTCAAGGAAGGGGGT 
CAGCTGCCTCCCGGTCACAGCGTGCAGGATGGGGTGCTCCGAATCCAGAACTTGGACCAGAGCTGCCAAG 
GGACGTATATATGCCAGGCCCATGGACCTTGGGGGAAGGCCCAGGCCAGTGCCCAGCTGGTTATCCAAGC 
CCTGCCCTCGGTGCTCATCAACATCCGGACCTCTGTGCAGACCGTGGTGGTTGGCCACGCCGTGGAGTTC 
GAATGCCTGGCACTGGGTGACCCCAAGCCTCAGGTGACATGGAGCAAAGTTGGAGGGCACCTGCGGCCAG 
25 GCATTGTGCAGAGCGGAGGTGTCGTCAGGATCGCCCACGTAGAGCTGGCTGATGCGGGACAGTATCGCTG 
CACTGCCACCAACGCAGCTGGCACCACACAATCCCACGTCCTGCTGCTTGTGCAAGCCTTGCCCCAGATC 
TCAATGCCCCAAGAAGTCCGTGTGCCTGCTGGTTCTGCAGCTGTCTTCCCCTGCATAGCCTCAGGCTACC 
CCACTCCTGACATCAGCTGGAGCAAGCTGGATGGCAGCCTGCCACCTGACAGCCGCCTGGAGAACAACAT 
GCTGATGCTGCCCTCAGTCCAACCCCAGGACGCAGGTACCTACGTCTGCACCGCCACTAACCGCCAGGGC 
30 AAGGTCAAAGCCTTTGCCCACCTGCAGGTGCCAGAGCGGGTGGTGCCCTACTTCACGCAGACCCCCTACT 
CCTTCCTACCGCTGCCCACCATCAAGGATGCCTACAGGAAGTTCGAGATCAAGATCACCTTCCGGCCTGA 
CTCAGCCGATGGGATGCTGCTGTACAATGGGCAGAAGCGAGTCCCAGGGAGCCCCACCAACCTGGCCAAC 
CGGCAGCCCGACTTCATCTCCTTCGGCCTCGTGGGGGGAAGGCCCGAGTTCCGGTTCGATGCAGGCTCAG 
GCATGGCGACCATCCGCCATCCCACACCACTGGCCCTGGGCCATTTCCACACCGTGACCCTGCTGCGCAG 
35 CCTCACCCAGGGCTCCCTGATTGTGGGTGACCTGGCCCCGGTCAACGGGACCTCCGAGGGCAAGTTCCAG 
GGCCTGGATCTGAACGAGGAACTCTACCTGGGTGGCTATCCTGACTATGGTGCCATCCCCAAGGCGGGGC 
TGAGCAGCGGCTTCATAGGCTGTGTCCGGGAGCTGCGCATCCAGGGCGAGGAGATCGTCTTCCATGACCT 
CAACCTCACGGCGCACGGCATCTCCCACTGCCCCACCTGTCGGGACCGGCCCTGCCAGAATGGCGGTCAG 
TGCCATGACTCTGAGAGCAGCAGCTATGTGTGCGTCTGCCCAGCTGGCTTCACCGGGAGCCGCTGTGAGC 
40 ACTCGCAGGCCCTGCACTGCCATCCAGAGGCCTGTGGGCCCGACGCCACCTGTGTGAACCGGCCTGACGG 
TCGAGGCTACACCTGCCGCTGCCACCTGGGCCGCTCGGGGTTGCGGTGTGAGGAAGGTGTGACAGTGACC 
ACCCCCTCGCTGTCGGGTGCTGGCTCCTACCTGGCACTGCCCGCCCTCACCAACACACACCACGAGCTAC 
GCCTGGACGTGGAGTTCAAGCCACTCGCCCCTGACGGGGTCCTGCTGTTCAGCGGGGGGAAGAGCGGGCC 
TGTGGAGGACTTCGTGTCCCTGGCGATGGTGGGCGGCCACCTGGAGTTCCGCTATGAGTTGGGGTCAGGG 
45 CTGGCCGTTCTGCGGACGGCCGAGCCGCTGGCCCTGGGCCGCTGGCACCGTGTGTCTGCAGAGCGTCTCA 
ACAAGGACGGCAGCCTGCGGGTGAATGGTGGACGCCCTGTGCTGCGCTCCTCGCCCGGCAAGAGCCAGGG 
CCTCAACCTGCACACCCTGCTCTACCTGGGGGGTGTGGAGCCTTCCGTGCCACTGTCCCCGGCCACCAAC 
ATGAGCGCTCACTTCCGCGGCTGTGTGGGCGAGGTGTCAGTGAATGGCAAACGGCTGGACCTCACCTACA 
GTTTCCTAGGCAGCCAGGGCATCGGGCAATGCTATGACAGCTCCCCATGTGAGCGCCAGCCTTGCCAACA 
50 TGGTGCCACGTGCATGCCCGCTGGCGAGTATGAGTTCCAGTGCCTGTGTCGAGATGGAATCAAAGGAGAC 
CTGTGTGAGCACGAGGAGAACCCCTGCCAGCTCCGTGAACCCTGTCTGCATGGGGGCACCTGCCAGGGCA 
CCCGCTGCCTCTGCCTCCCTGGCTTCTCTGGCCCACGCTGCCAACAAGGCTCTGGACATGGCATAGCAGA 
GTCCGACTGGCATCTTGAAGGCAGCGGGGGCAATGATGCCCCTGGGCAGTACGGAGCCTATTTCCACGAT 
GATGGCTTCCTCGCCTTCCCTGGCCATGTCTTCTCCAGGAGCCTGCCCGAGGTGCCCGAGACCATCGAGC 
55 TGGAGGTTCGGACCAGCACAGCCAGTGGCCTCCTGCTCTGGCAGGGTGTGGAGGTGGGAGAGGCCGGCCA 
AGGCAAGGACTTCATCAGCCTCGGGCTTCAAGACGGGCACCTTGTCTTCAGGTACCAGCTGGGTAGTGGG 
GAGGCCCGCCTGGTCTCTGAGGACCCCATCAATGACGGCGAGTGGCACCGGGTGACAGCACTGCGGGAGG 
GCCGCAGAGGTTCCATCCAAGTCGACGGTGAGGAGCTGGTCAGCGGCCGGTCCCCAGGTCCCAACGTGGC 
AGTGAACGCCAAGGGCAGCATCTACATCGGCGGAGCCCCTGACGTGGCCACGCTGACCGGGGGCAGATT 
60 TCCTCGGGCATCACAGGCTGTGTCAAGAACCTGGTGCTGCACTCGGCCCGACCCGGCGCCCCGCCCCCAC 
AGCCCCTGGACCTGCAGCACCGCGCCCAGGCCGGGGCCAACACACGCCCCTGCCCCTCGTAGGCACCTGC 
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CTGCCCCACACGGACTCCCGGGCCACGCCCCAGCCCGACAATGTCGAGTATATTATTATTAATATTATTA 
TGAATTTTTGTAAGAAACCGAGCGATGCCACGCTTTGCTCGTACCGCCCTGGGCTGGACTGGAGGTGGGC 
ATGCCACCCTCACACACACAGCTGGGCAAAGCCACAAGGCTGGCCAGCAAGGCAGGTTGGATGGGAGTGG 
GCACCTCAGAAAGTCACCAGGACTTGGGGTCAGGAACAGTGGCTGGGTGGGCCCAGAACTGCCCCCAGTG 
TCCCCCTACCCACCGATGGAGCCCCCAGATAGAGCTGGGTGGCCTGTTTCTGCAGCCCTTGGGCAGTTCT 
CACTCCTAGGAGAGCCAACCTCGGCTTGTGGGCTGGTGCCCCACAGCTACCTGAGACGGGCATCGCAGGA 
GTCTCTGCCACCCACTCAGGATTGGGAATTGTCTTTAGTGCCGGCTGTGGAGCAAAAGGCAGCTCACCCC 
TGGGCAGGCGGTCCCCATCCCCACCAGCTCGTTTTTCAGCACCCCCACCCACCTCCACCCAGCCCCTGGC 



SEQ ID NO:366 

>gi|7427517|ref |NP_005520.2| heparan sulfate proteoglycan 2 (perlecan); 

Heparan sulfate proteoglycan of basement membrane (perlecan) [Homo sapiens] 

MG WRAPGALLLALLLHGRLLAVTHGLRAYDGLS LPED I ETVTAS QMRWTHS YLSDDEDMLAD S I SGDDLG 

SGDLGSGDFQJWYFRALVNFTRSIEYSPQLEDAGSREFREVSEAVVDTLESEYLKIPGDQVVSVVFIKEL 

DGWVFVELDVGSEGNADGAQIQEMLLRVISSGSVASYVTSPQGFQFRRLGTVPQFPRACTEAEFACHSYN 

ECVALEYRCDRRPDCRDMSDELNCEEPVLGISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLLPGSV 

RPLPCGPQEAACRNGHCIPRDYLCDGQEDCEDGSDELDCGPPPPCEPNEFPCGNGHCALKLWRCDGDFDC 

EDRTDEANCPTKRPEEVCGPTQFRCVSTWMCIPASFHCDEESDCPDRSDEFGCMPPQWTPPRESIQASR 
GQTVTFTCVAIGVPAPFIilX^RLNTCGHIPSQ 

GIPDGVLELVPQRAGPCPDGHFYIiEHSAACLPCFCFGITSVCQSTRRFRDQIRLRFDQPDDFKGVNVTMP 
AQPGTPPLSSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPEQFLGNKVDSYGGSLRYNVRYELARGMLE 
PVQRPDVVLVGAGYRLLSRGHTPTQPGALNQRQVQFSEEHWVHESGRPVQRAELLQVLQSLEAVLIQTVY 
NTKMAS VGL SDIAMDTTVTHATSHGRAHS VEE CR CP I G YS GliSCES CDAHFTR VPGGP YIjGTCS GCS CNG 
HASSCDPVYGHCLNCQHNTEGPQCKKCKAGFFGDAMKATATSCRPCPCPYIDASRRFSDTCFLDTDGQAT 
CDACAPGYTGRRCESCAPGYEGNPIQPGGKCRPVNQEIVRCDERGSMGTSGEACRCKNNWGRLCNECAD 
RSFHLSTRKPDGCLKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAASTHTTNEG1FSPTPGELGF 
SSFHRLLSGPYFWSLPSRFLGDK^SYGGELRFTVTQRSQPGSTPLHGQPLVVLQGNMIILEHHVAQEPS 
PGQPSTFIVPFREQAWQRPDGQPATREHLLMALAGIDTLLIRASYAQQPAESRVSGISMDVAVPEETGQD 
PALEVEQCSCPPGYRGPSCQDCDTGYTRTPSGLYLGTCERCSCHGHSEACEPETGACQGCQHHTEGPRCE 
QCQPGYYGDAQRGTPQDCQLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPS 
QGQPCQRDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPDGCLPCFCM 
GITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPEGAQLSFGNFAQLGHESFYWQ 
LPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQITGNNIMLVASQPALQGPERRSYEIMFREEFW 
RRPDGQPATREHLLMALADLiDELLIRATFSSVPLVASISAVSLEVAQPGPSNRPRALEVEECRCPPGYIG 
LSCQDCAPGYTRTGSGLYLGHCELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPE 
DCQPCACPLTNPENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQCLPETNQAPLV 
VEVHPARSIVPQGGSHSLRCQVSGRGPHYFYWSREDGRPVPSGTQQRHQGSELHFPSVQPSDAGVYICTC 
RNLHRSNTSRAEIiL VTEAPS KPI TVTVEEQRS QS VRPGADVTF I CTAKS KS PAYTLVWTRUJNGKLPTRA 
MDFNGILTIRWQLSDAGTYVCTGSmFAIVD^ 

ATGSPTPTLEWTGGPGGQLPAKAQIHGGILRLPAVEPTDQAQYLCRAHSSAGQQVARAVLHVHGGGGPRV 
QVS PERTQVHAGRTVRLYCRAAGVPS ATITWRKEGGSI4PPQARSERTD I ATLLIPAITTADAGF YLCVAT 
S PAGTAQARIQVWL S AS DAS Q P P VKIES S S PS VTEGQTLDLNCVVAGSAHAQVTWYRRGGSLPHHTQVH 

GSRLRLPQVSPADSGEYVCRVENGSGPKEAS ITVSYLHGTHSGPS YTPVPGSTRPIRIEPSSSHVAEGQT 
LDLNCWPGQAHAQVTI^KRGGSLPARHQ 

VI PGP I PPVRI E S S S S TVAEGQTLDL S C WAGQAHAQVTW YKRGGS IjPARHQVRGSRLYI FQAS PADAGQ 

WCRASNGMEASITVTVTGTQGANIAYPAGSTQPIRIEPSSSQVAEGQTLDLNCVVPGQSHAQVTWHKRG 

GSLPVRHQTHGSLLRLYQASPADSGEYVCRVLGSSVPLEASVLVTIEPAGSVPALGVTPTVRIESSSSQV 

AEGQTLDLNCLVAGQAHAQVTWHKRGGSLPARHQVHGSRLRLLQVTPADSGEYVCRWGSSGTQEASVLV 

TIQQRLSGSHSQGVAYPVRIESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQIVGSRLRIP 

QVTPADSGEYVCHVSNGAGSRETSLIVTIQGSGSSHVPRVSPPIRIESSSPTWEGQTLDLNCWARQPQ 

AIITWKRGGSLPSRHQTHGSHLRLHQMSVADSGEW^ 

IRIESSSSHVAEGETLDLNCWPGQAHAQVTWHra^ 

SGPIiEASVLVTIEASGSSAVHVPAPGGAPPIRIEPSSSRVAEGQTLDLKCWPGQAHAQVTWHKRGGNLP 
ARHQVHGPLLRL^QVSPADSGEYSCQVTGSSGTLEASVLVTIEPSSPGPIPAPGLAQPIYIEASSSHVTE ' 
GQTLDLNCVVPGQAHAQVTWYKRGGS LPARHQTHGSQLRLHHVS PA3DSGE YVCRAAGGPGPEQEAS FTVT 

VPPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRNQELEDNVHISPNGSIITI 
VGTRPSNHGT YRCVASNAYGVAQS WNL S VHGP PTVS VLP EGP VWVKVGKAVTIjECVS AGEPRS SARWTR 
I S S TPAKI/EQRTYGLMDSHTVLQI S S AKPSDAGTYVCLAQNAIjGTAQKQVE VI VDTGAMAPGAPQVQAEE 
AELTVEAGHTATLRCS ATGS PARTIHWS KLRS PIjPWQHRLEGDTTjI I PRVAQQDSGQYI CNATS PAGHAE 
ATIILHVESPPYATWPEHASVQAGETVQLQCL^^^ 
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DSGRYRCRVTNKVGSAEAFAQLLVQGPPGSLPATS IPAGSTPTVQ VTPQLETKS IGAS VE FHCAVP SDRG 
TQLRWFKEGGQLPPGHSVQDGVTJRIQOTjDQSCQGTYICQAHGPWGKAQASAQLVIQALPSVIiINIRTSVQ 
TWVGHAVEFECL ALGDPKPQVTWS KVGGHLRPGI VQS GGWR I AHVELADAGQYRCTATNAAGTTQSHV 
LLLVQALPQISMPQEVl^VPAGSAAVFPCIASGYPTPDISWSKLDGSL^ 
5 YVCTATNRQGKVKAFAHLQVPER WPYFTQTP YS FLPLPT I KDAYRKFE I KITFRPDS ADGMLLYNGQKR 
VPGSPTNIiANRQPDFISFGLVGGRPEFRFDAGSGMATIRHPTPI^GHFHTVTLLRSLTQGSLIVGDLAP 
WGTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSSGFIGCVRELRIQ^ 

RDRPCQWGGQCHDSESSSYVCVCPAGFTGSRCEHSQAIiHCHPEACGPDATCVl^PDGRGYTCRaiLGRSG 
LRCEEGVT VTTPSLS GAGS YLABPALTNTHHELRLD VE FKPLAPDGVLLFS GGKS GP VEDFVS L AMVGGH 

10 LEFRYELGSGLA\n^RTAEPLALGRWHRVSAERLNKDGS 

PS VPLS PATNMS AHFRGCVGEVS VNGKRLDLT YS FLGS QG IGQCYDS S PCERQPCQHGATCMPAGE YEFQ 
CLCRDGIKGDLCEHEENPCQLREPCLHGGTCQGTRCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDA 
PGQYGAYFHDDGFIAFPGHVFSRSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFISLGLQDGH 
IiVFRYQLGSGEARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSPGPNVAVNAKGSIYIGGAP 

1 5 DVATLTGGRFSSGITGCVK^TLVLHSARPGAPPPQPLDLQHRAQAGANTRPCPS 

SEQ ID NO:367 

PGR-l (T-cell activation protein) 
>GL3_8D_3_M13F Direction: N/A 

20 ATATGCTCATCCAGATCAAAACGCAGGTGGAGGCCTCGGAGGAGAGCGCCCTCAACCACCTCCAGAACCCGGGCG 
ACGCGGCCGAGGGCCGGGCGGCCAAGAGGTGCGAGAAGGCCGAGGAGAAGGCCGANCGGAAGATTTGCTGAAAAC 
TNGCATGTAGATGCTTGGTGGAGCTGTGTCCGGCTGGATTAGCAGTAAGAGCGAAGTTCAGTCGGTGAGCGCGGT 
CGGGCGGTTTTCCAGGGTCCGATGGGATTCTCTTGGCCAACCTTGGTGGTAGATGTGGCTTGACTACCCTTGGAA 
GGAAAGGCCTGGATAAGCCTAGAAGAAGCCCCTTTTTGGTTGTATCTCTTCTNGTTCTTTCCTGTCTATTNCCTA 

25 TCTTGCTCTTGAGCTTTTANANAGTTTAACAGGNTGTCTGCTTGGGCATAATTGGGCTCATGATGGGTATTCGAT 
GACCCTTTCGAATCATGGAANGTTTTTCGGAATTTTGTTAAACGGTGGTGTGAGTAGATTTTACGGGGGAAAAGG 
TTAAATTGT 

SEQ ID NO:368 

30 >GL3__2 6KV_PCR_G3F1 Direction: N/A 

ATATGCTCATCCAGATCAAAACGCAGGTGGAGGCCTCGGAGGAGAGCGCCCTCAACCACCTCCAGAACCCGGGCG 
ACGCGGCCGAGGGCCGGGCGGCCAAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGGAGATTGCGAAGATGGCA 
GAGATGCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGCGCGGTCGGCGGTTTCCAGCCAA 
TGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCTGGAGAAGCCGAAACCAGAGAGCCTTTTGTTTTCTCTT 

35 TTTTTTCCTGTCTATGCTCTGTCTCACTTTAACACTACGTTTTCTGCTATGGGTCCTGGTGAGATTGATGACC 

SEQ ID NO:369 

>GL3_J27BP_3JM13F Direction: N/A 

ATATGCTCATCCAGATCAAAACGCAGGTGGAGGCCTCGGAGGAGAGCGCCCTCAACCACCTCCAGAACCCGGGCG 
40 ACGCGGCCGAGGGCCGGGCGGCCAAAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAAGATGGCA 
GAGATGCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGCGCGGTCGGCGGTTTCCAGCCAA 
TGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCTGGAGAAGCCGAAAACCAGAGAGCCTTTGTGTTTCCTC 
TTTTTTTTCCCTGTCTATGCTCTGTCCTCACTTAACACTACGTTTCCTGCTATGGTCATGTGGTTGATGACCCTC 
AATATGAGTTTCGAATGGTAACGTGTATAGAGTAGTGGGAAGTAATTT 
45 GGAATTAGGGACCTAGGATTGTAAGCTCTTGCCAGGCAGGTCAACATTTTGTTCCCGGGGCTTTGGGGGGTAATT 
TCTA 



SEQ ID NO:370 

>GL3_27BP_3_M13R Direction: anti-sense 

50 CCCCGTTGAAGTCTGGTCCTTTCTTCAGTAGTTGCTGCATGGAGTCGTCTTCCAGAATTAATGCAGCTCCTTCAG 
CACCTCCAGAGCTGGAGAACTCCAACTGTGAGTCTCATGTGTCACACACAGACCCAAAGTTCCAGGGAGCTATCA 
GGTCACACAAGAAATAGCAAAGCACCTCAAAAATTTAGAAATAACCACCAAAGCCCGGGAACAAATGTGACT 
GGAAGAGCTTACAATCTAGGTCCTAATTCCTGTATGTGAGAGCATTTTCAAACAAAATTACTTCCCAAACAAAAA 
CACGTTAACAATCGAAACTCATATTGAGGTCATCAACGACAGACGATAGCAGAAAACGTAGTGTTAAGTGAGACA 

55 GAGGATAGACAGGAAAAAAAAGAGAAAACAAAAGGCTCTCTGGTTTCGGCTTCTCCAGGGTGTCAGCCAATCTCC 
ACCAGTTGACCAGAATCCATTGGCTGGAAACCGCCGACCGCGCTCACGACGACTCGCTCTTCTCTATCCGCCGGA 
CCAGCTCCACCAGCATCTTCTGCCATCTTCGCAATCTCCTTGGCCTTCTCCTCGGCTTCTCGCACCTCTT 
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SEQIDNO:371 

>GL3_8D_1JM13R Direction: anti -sense 

CCCCGTTGAAGTCTGATCCTTTCTTCAGTAGTTGCTGCATGGAGTCGTCTTCCAGAA.TTAATGCAGCTCCTTCAG 
CACCTCCAGAGCTGGAGAACTCCAACTGTGAGTCTCATGTGTCACACACAGACCCAAAGTTCCAGGGAGCTATCA 
GGTCACACAAGAAATAGCAAAGCACCTCAAAAATTTAGAAATAACCACCAAAGCCCGGGAACAAATGTGACTGCT 
GCAAGAGCTTACAATCTAGGCCCTAATTCCTGTATGTGAGAGCATTTTCAAACAAAATTACTTCCCAAACAAAAA 
CACGTTAACAATCGAAACTCATATTGAGGTCATCAACCACAGACCATAGCAGAAAACGTAGTGTTAAGTGAGACA 
GAGCATAGACAGGAAAAAAAAGAGAAAACAAAAGGCTCTCTGGTTTCGGGTTCTCCAGGGTGTCAGCCAATCTCC 
ACCAGTTGACCAGAATCCATTGGCTGGAAAC 

SEQ ID NO:372 

>GL3_8D__2_M13R Direction: anti-sense 

CCCCGTTGAAGTCTGGTCCTTTCTTCAGTAGTTGCTGCATGGAGTCGTCTTCCAGAATTAATGCAGCTCCTTCAG 

CACCTCCAGAGCTGGAGAACTCCAACTGTGAGTCTCATGTGTCACACACAGACCCAAAGTTCCAGGGAGCTATCA 

GGTCACACAAGAAATAGCAAAGCACCTCAAAAATTTAGAAATAACCACCAAAGCCCGGGAACAAATGTGACTGCT 

GCAAGAGCTTACAATCTAGGCCCTAATTCCTGTATGTGAGAGCATTTTCAAACAAAATTACTTCCCAAACAAAAA 

CACGTTAACAATCGAAACTCATATTGAGGTCATCAACCACAGACCATAGCAGAAAACGTAGTGTTAAGTGAGACA 

GAGCATAGACAGGAAAAAAAAGAGAAAACAAAAGGCTCTCTGGTTTCGGCTTCTCCAGGGTGTCAGCCAATCTCC 
AC CAGTTGACC AGAATCCATTG 

SEQIDNO:373 

>GIi3_8D_3_M13R Direction: anti -sense 

CCCCGTTGAAGTCTGGTCCTTTCTTACAGTAGTTGCTGCATGGAGTCGTCTTCCAGAATTAATGCAGCTCCTTCA 

GCACCTCCAGAGCTGGAGAACTCCAACTGTGAGTCTCATGTGTCACACACAGACCCAAAGTTCCAGGGAGCTATC 

AGGTCACACAAGAAATAGCAAAGCACCTCAAAAATTTAGAAATAACCACCAAAGCCCGGGAACAAATGTGACTGC 

TGCAAGAGCTTACAATCTAGGCCCTAATTCCTGTATGTGAGAGCATTTTCAAACAAAATTACTTCCCAAACAAAA 

ACACGTTAACAATCGAAACTCATATTGAGGTCATCAACCACAGACCATAGCAGAAAACGTAGTGTTAAGTGAGAC 

AGAGCATAGACNGGAAAAAAAAGAGAAAACAAAAGGCTCTCTGGTTTCGGCTTCTCCAGGGTGTCAGCCAATCTC 

CACCAGTTGGCCAGAATCCATTGGCTGGAAACCGCCGACCGCGCTCACGACGACTCGCTCTTCTCTATCCGCCGG 
ACCAGCTCCACCAGCATCTCTGCCATCTTC 

SEQIDNO:374 

>GL3_8D_4_M13F Direction: N/A 

AATATGCTCATCCAGATCAAAACGCAGGTGGAGGCCTCGGAGGAGAGCGCCCTCAACCACCTCCAGAACCCGGGC 

GACGCGGCCGAGGGCCGGGCGGCCAAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAAGATGGCA 

GAGATGCTGGTGGAGCTGGTCTGGCGGATAGAGAAGAGCGAGTCGTCGTGAGCGCGGTCGGCGGTTTCCAGCCAA 

TGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCTGGAGAAGCCGAAACCAGAGAGCCTTTTGTTTTCTCTT 

TTTTTTCCTGTCTATGCTCTGTCTCACTTAACACTACGTTTTCTGCTATGGTCTGTGGTTGATGACCTCAATATG 
AGTTTCGATGTGTTAACGT 

SEQ ID NO:375 

>GIi3_8D_4_Ml3R Direction: anti -sense 

CCCCGTTGAAGTCTGGTCCTTTCTTCAGTAGTTGCTGCATGGAGTCGTCTTTCCAGAATTAATGCAGCTCCTTCA 

GGCACCTCCAGAGCTGGAGAACTCCAACTGTGAGTCTCATGTGTCACACACAGACCCAAAGTTCCAGGGAGCTAT 

GAGGTCACACAAGAAATAGCAAAGCACCTCAAAAATTTAGAAATAACCACCAAAGCCCGGGAACAAATGTGACTG 
CTGCAAGAGCT 

SEQIDNO:376 

>GL3_8D_PCR__G3F1 Direction; N/A 

ATATGCTCATCCAGATCAAAACGCAGGTTGAGGCCTCGGAGGAGAGCGCCCTCAACACCTCCAGAACCCGGGCGA 
CGCGGCGAGGCCGGGCGGGCAAAAGTGCGAGAAGGCCGAGGAGAAGGCCCAGGAGATTGCGAAGATTGCAGAGAT 
GCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGCGCGGTCGGCGGTTTCCAGCCCATGGAT 
TCTGGTCAACTGGTGGAGATTGGCTGACACCCTGGAGAAGCCGAAACCAGAGAGCCTTTTGTTTTCTCTTTTTTT 
TCCTGTCTATGCTCTGTCTCACTTAACACTACGTTTTCTGCTATGGTCTGTGGTTGATGACCTCAATATGAGTTT 
CGATTGTTAACGTGTTTTTGTTTGGGAAGTAATTTTGTTTGAAAATGCTCTCACATACAGGAATTA 

SEQ ED NO:377 

>gi| 15193293 | ref | NM_033296 . 1 1 Homo sapiens T-cell activation protein 
(PGR1) , mRNA 

TTGTCCGTGGCTTCTCTGAGAAGAAAAGTTGAAAAAGGGTAAAAGTTTTCAGGAATATTCGGGCTCTCTA 
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TTGCTAAGCATAGCGAGTGTCGGTTTTCTCTCTCCAACAGACATCGCTATTGCGGTTCCGAGGCAGTGGG 
AAGAGATGCGGCCCCTGGACATCGTCGAGCTGGCGGAACCGGAGGAAGTGGAGGTGCTGGAGCCCGAGGA 
GGATTTCGAGCAGTTTCTGCTCCCGGTCATCAACGAGATGCGCGAGGACATCGCGTCGCTGACGCGCGAG 
CACGGGCGGGCGTACCTGCGGAACCGGAGCAAGCTGTGGGAGATGGACAATATGCTCATCCAGATCAAAA 
5 CGCAGGTGGAGGCCTCGGAGGAGAGCGCCCTCAACCACCTCCAGAACCCGGGCGACGCGGCCGAGGGCCG 
GGCGGCCAAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAAGATGGCAGAGATGCTGGTG 
GAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGCGCGGTCGGCGGTTTCCAGCCAATGGATTC 
TGGTCAACTGGTGGAGATTGGCTGACACCCTGGAGAAGCCGAAACCAGAGAGCCTTTTGTTTTCTCTTTT 
TTCCTGTCTATGCTCTGTCTCACTTAACACTACGTTTTCTGCTATGGTCTGTGGTTGATGACCTCAATAT 

10 GAGTTTCGATTGTTAACGTGTTTTTGTTTGGGAAGTAATTTTGTTTGAAAATGCTCTCACATACAGGAAT 
TAGGGCCTAGATTGTAAGCTCTTGCAGCAGTCACATTTGTTCCCGGGCTTTGGTGGTTATTTCTAAATTT 
TTGAGGTGCTTTGCTATTTCTTGTGTGACCTGATAGCTCCCTGGAACTTTGGGTCTGTGTGTGACACATG 
AGACTCACAGTTGGAGTTCTCCAGCTCTGGAGGTGCTGAAGGAGCTGCATTAATTCTGGAAGACGACTCC 
ATGCAGCAACTACTGAAGAAAGGACCAGACTTCAACGGGGAGTGTGGATGGGCCGACCTGGCTGGGACTC 

15 GTGAATCTGGAGAAGAGCTGGAGAATGGATAGTATTGTCTGTATTTGGAGACTTTAATTTCTGTGTGAGA 
CCAAAGGAGGAGAGATGTGTTTTGTTCAAAATTTAAATTTGTTGTGGTACACTATCTTATGTAACCTGTC 
TGGTGAGTTTGTTTGGACAACCTAACTCAGCTTTATTTGACATGGAACCTAAAATAGAAGATAAGATCTT 
GATATTCTGTACAAGTTGATGTAATACCCTGATGCGTTTTAGAGGACTTGGCATAAAATGAAAGATTGGC 
AAAGGCCCTTGAGGGGCTTGGGGATGACAGTATGGAACTGTCTGCATTGGACCCTAAACTGGACTAGAAG 

20 AGGCATCTTCAAGGTTCATACGTTGTCCAGCTGTAAGTTCATTTGAGTAGCAGACCTAACAAATATTTGA 
GGTCAGAACCCTACCATGTTAA?\ACAAACAAAAACTTACCATGTTAATAAAAGTATTCATTTGC 



SEQ ID NO:378 

>gi| 15193294 | ref |NP_150638 . 1 1 T-cell activation protein [Homo sapiens] 
25 MRPIJDIVELAEPEEVEVX.EPEEDFEQFL 

VEASEESALNHLQNPGDAAEGRAAKRCEKAEEKAKEIAKMAEMLv^ 



SEQ ID NO:379 

Phosphoribosyl pyrophosphate synthetase-associated protein 1 

30 >GL3_3 8B_PCR__G3F1 Direction: N/A 

CGCCCGCCCCGTTCCCCCGCCGGCCATGAACGCCGCTCGCACCGGCTACCGAGTCTTCTCGGCCAACTCCACGGC 
CGCCTGCACGGAGCTGGCCAAGCGCATCACAGAGCGCCTTGGTGCTGAATTGGGGAAGTCTGTTGTATATGAAGA 
GACCAATGGAGAAACAAGAGTTAAAATAAAAGAATCTGTTCGTGGCCAAGATATTTTCATTATACAGACAATACC 
CAGAGATGTGAATACAGCTGTGATGGAGTTGCTCATCATGGCTTACGCACTGAAGACTGCCTGTGCCAGGAACAT 

35 TATTGGGGTCATCCCCTACTTCCCCTACAGCAAGCAGAGCAAGATGAGAAAGAGGGGTTCCATTGTGTGCAAGCT 
GCTAGCATCCATGCTGGCGAAAGCAGGTTTAACTCACATTATCACTATGGATCTTCATCAAAAGGAAATACAAGG 
CTTTTTCAGCTTTCCTGTGGACAACCTTAGAGCCTCACCTTTCCTGCTTCAGTATATCCGGAAGAAATTCCAAAT 
TACAGAAATGCAGTCATTGTAGCTAGTCTCCTGATGCTGCAAAGAGGGCCCAGTCCTATGCGGAGAGACTGCGTC 
TGGGTTTTGGCCGTCATTCACGGGGAAGCTCAGTGCACGGAACTGG 

40 

SEQIDNO:380 

>GL3_8K_2_M13R Direction: sense 

CTGCCCGGCCCCGGTTCCCCCTGCCNGGCCATGAACGCCGCTCTGCACCGGCTACCGAGTCTTCTCGGCCAACTC 
CACGGCCGCCTGCACGGAGCTGGCCAAGCGCATCACAGAGCGCCTTGGTGCTGAATTGGGGAAGTCTGTTGTATA 
45 TGANAGAGACCAATGGAGGAAACCAAGAGTTACACANAGTACAAAGAANTCTGTTCGTTGGCCAAGGATATTTTC 
ATTATACAGACAATTAACCCAGAGATGTGAATACAGCTGTGATGGAGTTGCTCATCATGGCTTACGCACTGAAGA 
CTGCCTGTGCCAGGAAC 



SEQIDNO:381 
50 >GIj3__8K_3_M13R Direction: sense 

CGCCCGCCCCGTTCCCCCGCCGGCCATGAACGCCGTCGCACCGGTACCGAGTCTTCTCGGCAACTCCACGGCGCC 
TGACGGAGCTGGCCAAGCGCATACAGAGCGCCTTGGTGCTGAATTGGGGAAGTCTGTTGTATATTAAGAGACCAA 
TGGGAAACAAGAGTTAAATAAAAGATCTGTTCGTGG 



55 SEQIDNO:382 

>GIi3_8K_4_M13F Direction: anti -sense 

CGCCCGCCCCNGTTTCCCCCGCCGGCCATGAACGCCGCTCGCACCCGGTACCGAGTTCTTCTCGGCCAACTCCAC 
GGCCGCCTGCACGGAGCCTTGGCCAAGCCGCATCACAGAGCGCCTTGGTGCTGAATGGGGGAAGTCTGTGGTATA 
TGAAGAGACCAATGGAGGAAACAAGAGTTAAAATAAAAGAATTCTGTTCGTGGCCAAGATTATTTTTCATTTATA 
60 CAGACAATTACCCAGAGATGTTGAATTACAGCTTGTT 
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GACTTGCCTTGTTGCCAGGAACATTTATTGGGGTCATCCCCTTACTTCCCCTACAGCAAGCAGAGCAAAGATGAG 

AAAGAGGGGTTCCATTGTGTGCAAGCTGCTTAGGCAATCCATGCTGGCCGAAAAGCMGGTTTAACTCACATTATC 

ACTATGGATCTTCATCAAAAGGAAATACAAGGGCTTTTACAAGCTTTCCTTGTGGAACAAACCCTTTAGAAGCCT 

TCAACCTTTTTCCTGCTTCAGTATATCCCAGGAAGGAAATTCCAAATTACCAGAAATGCAGGTCATTTGGTAGCT 

AAGTCTCCCTGATGCCTTGCCAAGGANGGGCCAGTCCTATTCGGAGAANACATGCGTCTGGTTGGGCCGGAANTC 
ACACGGGAAACCNCATGCACAGAACTCNGGACCTTGG 

SEQ ID NO:383 

>GL3_8K_PCR_G3F1 Direction: N/A 

CGCCCGCCCCGGTTCCCCGCCGGCCATGAACGCCGCTCGCACCGGCTACCGAGTCTTCTCGGCCAACTCCACGGC 

CGCCTGCACGGAGCTGGCCAAGCGCATCACAGAGCGCCTTGGTGCTGAATTGGGGAAGTCTGTTGTATATGAAGA 

GACCAATGGAGAAACAAGAGTTAAAATAAAAGAATCTGTTCGTGGCCAAGATATTTTCATTATACAGACAATACC 

CAGAGATGTGAATACAGCTGTGATGGAGTTGCTCATCATGGCTTACGCACTGAAGACTGCCTGTGCCAGGAACAT 

TATTGGGGTCATCCCCTACTTCCCCTACAGCAAGCAGAGCAAGATGAGAAAGAGGGGTTCCATTGTGTGCAAGCT 

GCTAGCATCCATGCTGGCGAAAGCAGGTTTAACTCACATTATCACTATGATCTTCATCAAAAGGAAATACAAGGC 

TTTTTCAGCTTTCCTGTGGACAACCTTAGAGCCTCACCTTTCCTGCTTCAGTATATCCGGAAGAAATTCCAAATT 

A.CAGAAATGCAGTCATTGTAGCTAACGTCTCCTGATGCTGCAAAGAGGGCCCAGTCCTATGCGGAGAGACTGCGT 

CTGGGTTTGGCCGTCATTCACGGGGAAGCTCAGTGCACGGAACTGGACATTGGACGATGGTCGTCAACTCCCCGC 
TATGGTCAAAATGCTACTGTGCACCC 

SEQ ID NO:384 

>9i|4506130|ref |NM_002766.1| Homo sapiens phosphoribosyl pyrophosphate 
synthetase-associated protein 1 (PRPSAPl) , mRNA 

GGTGCGCAAGGGCACGGACCTCGGAGCTCTCCCCGTTCCCCCGCCGGCCATGAACGCCGCTCG CACCGGC 

TACCGAGTCTTCCTCGCCAACTCCACGGCCGCCTGCACGGAGCTGGCCAAGCGCATCACAGAGCGCCTTG 

GTGCTGAATTGGGGAAGTCTGTTGTATATCAAGAGACCAATGGAGAAACAAGAGTTGAAATAAAAGAATT 

TGTTCGTGGCCAAGATATTTTCATTATACAGACAATACCCAGAGATGTGAATACAGCTGTGATGGAGTTG 

CTCATCATGGCTTACGCACTGAAGACTGCCTGTGCCAGGAACATTATTGGGGTCATCCCCTACTTCCCCT 

ACAGCAAGCAGAGCAAGATGAGGAAGAGGGGTTCCATTGTGTGCAAGCTGCTAGCATCCATGCTGGCGAA 

AGCAGGTTTAACTCACATTATCACTATGGATCTTCATCAAAAGGAAATACAAGGCTTTTTCAGCTTTCCT 

GTGGACAACCTTAGAGCCTCACCTTTCCTGCTTCAGTATATCCAGGAAGAAATTCCAAATTACAGAAATG 

CAGTCATTGTAGCTAAGTCTCCTGATGCTGCAAAGAGGGCCCAGTCCTATGCGGAGAGACTGCGTCTGGG 

TTTGGCCGTCATTCACGGGGAAGCTCAGTGCACGGAACTGGACATGGACGATGGTCGTCACTCCCCGCCT 

ATGGTCAAAAATGCTACTGTGCACCCAGGCCTGGAGTTGCCATTGATGATGGCCAAAGAGAAGCCACCGA 

TAACTGTAGTTGGAGATGTTGGAGGCCGCATCGCAATCATCGTGGATGACATTATTGACGATGTGGAGAG 

TTTTGTTGCTGCCGCGGAGATCCTGAAAGAGAGAGGCGCCTATAAGATCTATGTTATGGCCACCCACGGC 

ATCCTGTCTGCAGAGGCCCCTCGCCTGATTGAGGAGTCCTCCGTAGACGAGGTGGTGGTGACGAATACTG 

TCCCTCATGAGGTTCAGAAGCTGCAATGTCCCAAGATAAAGACTGTGGATATCAGTTTGATTCTTTCTGA 

AGCCATTCGGAGAATCCACAATGGAGAGTCCATGGCCTACCTTTTCCGAAACATCACTGTGGATGACTAG 

CTTTCACGAGGGTCTCGACCCTGGACCTCCTGAGGGAAACATGGAAAAAGCAGTGCCATGAGTGATACAG 

TGTTTCCTTGCAAGGGAGGACTCGAAACAGCCTGGAGTTAGATATCTTCTTTTGCCCGGATTGATGGGGA 

GGAGGGATTAAAAGAGTCAGGAAGAAGACAGAGCTAATGGATAAATATCATAACATGGCCTTACATGTCT 

GCTGTCATCAGCCCTGTTCCTTAAAAGTTCTAGCTGCTTTCTTAAAAATAATCTGAAAATCTTATTGATA 

CTAAAGAGGAGTTAAAGGCACATAAAGTCTTAACTCTATAATGTTCATTTAGTTGTTTCAGCTCCAGGGA 

AATGGAGGTATTGATGTTGAACCTGGTTAGGGAAGCTGAGCGCCTGTGGCCCTATTACTATCCAGTTGGC 

CTCTCCGAAATCAACTTCAAGTCTTTTATAGAGAATCGTATTTTTCTTTCAGAAATTGCTATGCCTACAG 

CCATTGAAAAATGAAGCATTCATGTTGTTACATCTTCCAAGGATGTCAGATTAGAAAATAGCATCCCACC 

TCTGGGTATCTGAGTGGCTCTGAAGTTGCAAATAAAATAATTTGTTGT 

SEQ ID NO:385 

>gi|450613l|ref |NP_002757.i| phosphoribosyl pyrophosphate synthetase- 
associated protein l [Homo sapiens] 

MNAARTGYRVFLANSTAACTELAKRITERLGAELGKSVVYQETNGETRVEIKEFVRGQDIFIIQTIPRDV 
NTAVMELLIMAYALKTACARNIIGVIPYPPVSlfnffTfMDirpn'eT^rr't^T t 7v oiuit .v» m ™,r™ m , 



NTAVMELLI^YALKTACARNIIGVIPYFPYSKQSKMRKRGSIVCKLIASMLAKAGLTHIITMDLHQ 

QGFFSFPVDNLRASPFLLQYIQEEIPNYRNAVIVAKSPDAAKRAQSYAERLRLGLAVIHGEAQCTELDMD 

DGRHSPP^IVKNATVHPGLELPLMMAKEKPPITVVGDVGGRIAIIVDDIIDDVESFVAAAEILKERGAYKI 

YVMATHGILSAEAPRLIEESSYDEVWTNTVPHEVQKLQCPKIKTVDISLILSEAIRRIHNGESMAYLFR 
NITVDD 
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SEQIDNO:386 
Plakoglobin 

>G3_1_90_PCR_G3F1 Direction: N/A 

GTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCACAACCTCTCCCAC 
CACCGGGAGGGGCTGCTCGCCATCTTCZ^GTCGGGTGGCATCCCTTCTCTGGTCCGCATGCTCAGCTCCCCTTTT 
GAGTCGGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGCCAAGATGGCCGTG 
CGCCTGGCCGACGGGCTGCAAAAGATGGTGCCCCTGCTCAACAAGAACAACCCCAAGTTCCTGGCCATCACCACC 
GACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGGTGGGCCCCAGGGC 
CCTCGGTGCACGATTATGCGTTACTTCGTTTTAAAAGC 

SEQ ID NO:387 

>G3_4_01__PCR_G3F1 Direction: N/A 

CCAGTGCTGTCGTTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCAC 
AACCTCTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCTGCTCTGGTCCGCATGCTC 
AGCTCCCCTTTTGAGTCGGTCCTGTTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGC 
CAAGATGGCCGTGCGCCTGGCCGACGGGCTGCAAAAGATGGTGCCCCTGCTCAACAAGAACAACCCCAGTTCCTG 
GCCATCACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGGT 
GGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACGTTATTAAAAGCTGCTC 

SEQ ID NO:388 

>G3_4_44_PCR_G3F1 Direction: N/A 

TCCAGTGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCAC 

AACCTCTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCTGCTCTGGTCCGCATGCTC 

AGCTCCCCTTGTGGAGTCTGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGC 

CAAGATGGCCGTGCGCCTGGCCGACGGGCTGCAAAAGATGGTGCCCCTGCTCAACAAGAACAACCCCAAGTTCCT 

GGCCATCACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGG 

TGGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACGTTATGAAAAGCTGCTCTGGACCACCAGTCGTGTGCTC 

AAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGGAGGCTGGTGGGATGCAGGCCCTGGGCAAGCAC 

CTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGACCCTGCGCAACCTCTCAGATTTGGCCACCAGG 
CAGGAGGGC 

SEQ ID NO:3S9 

>G3_4_90_PCR_G3F1 Direction: N/A 

CCAGTGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCACA 
ACCTCTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCTGCTCTGGTCCGCATGCTCA 
GCTCCCCTGTGGGAGTCGGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGCC 
AAGATGGCCGTGCGCCTGGCCGACGGGCTGC^AAAGATGGTGCCCCTGCTCAACAAGAACAACCCCAAGTTCCTG 
GCCATCACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGGT 
GGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTATGAAAAGCTGCTCTGGACCACCAGTCGTGTGCTC 
AAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGGAGGCTGGTGGGATGCAGGCCCTGGGCAAGCAC 
CTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGACCCTGCGCAACCTCTCAGA 

SEQIDNO:390 

>G3__5_91_PCR_G3F1 Direction: N/A 

TGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCACAACCT 
CTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCTGCTCTGGTCCGCATGCTCAGCTC 
CCCTTGGAGTCGGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGCCAAGATG 
GCCGTGCGCCTGGCCGACGGGCTGGAAAAGATGGTGCCCCTGCTCAACAAGAACAACCCGAAGTTCCTGGCCATC 
ACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGGTGGGCCC 
CAGGCCCTCGTGCAGATCATGCGTAACTACAGTTATGAAAAGCTGCTCTGGACCACCAGTCGTGTGCTCAAGGTG 
CTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGGAGGCTGGTGGGATGCAGGCCCTGGGCAAGCACCTACCA 

GCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGACCCTGCGCAACCTCTCAGATTTGGCCACCAGGCAGGAGG 
GCCTGGAGAGTGTGCTGAAAGATTCTGGTGA 

SEQIDNO:391 

>G3_7_64_PCR_G3F1 Direction: N/A 
AGCTTTTTATAACTGTAGTTACGCATGATCTC 

CTCTCCTGGTTGCCGTAGCCAGAGCTGCAGCAGTCGTGTATGCCAGAACTTGGGGTTGTTCTTGTTGAGCAGGGG 
CACCATCTTTTGCAGCCCGTCGCCAGGCGCACGGC^ 

GTGGTGATGCATAGAACAGGACCGACTCCACANGGGAGCTGAGCATGCGGACCAGAGCAGGATGCCACCCACTTG 
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AAATGCAGCAGCCCCTCCCGGTGTGGAGAGGTTGTGCAGGATGCTGTGGTGCAGCGGGCTGTGTCCAGTCGCTGT 

ATTCTGCATGGTACGCACGACGCGCCACCACTGGGCGAGCCCATCAGGGCCCGCCGCGACCCTCCTTTTCGACGC 
TGTTCACAATCATGCCCCTTGTCAC 

SEQIDNO:392 

>GL1_34_HIGH_1_G3F1 Direction: N/A 

GAGAGGCGCCATNGNCGGTGGCAGCGGTGGCTCCAGTGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGA 
CACAGCCCGCTGCACCACCAGCATCCTGCACAACCTCTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTC 
GGGTGGCATCCCTGNCTCTGGTCCGCATGCTCAGCTCCCCTGNTGGAGTCGGTCCTGTTCTATGCCATCACCACG 
CTGCACAACCTGCTCCTGTACCAGGAGGGCGCCAAGATGGCCGTGCGCCTGGCCGACGGGCTGCAAAAGATGGTG 
CCCCTGCTCAACAAGAACAACCCCAAGTTCCTGGCCATCACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAAC 
CAGGAGAGCAAGCTGATCATCCTGGCCAATGGTGGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTAT 
GAAAAGCTGCTCTGGACCACCAGTCGTGTGCTCAAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTG 
GAGGCTGGTGGGATGCAGGCCCTGGGCAAGCACCTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGG 
ACCCTGCGCAACCTCTCAGATGTGGCCACCAGGCAGGAGGGCCTGGAG 

SEQ ID NO:393 

>GL1_34_HIGH_2_G3F1 Direction: N/A 

GNCGGTGGCAGCGGTGGCTCCAGTGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGC 
ACCACCAGCATCCTGCACAACCTCCCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCT 
GCTCTGGTCCGCATGCTCAGCTCCCCTGNTGGAGTCGGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCT 
CCTGTACCAGGAGGGCGCCAAGATGGCCGTGCGCCTGGCCGACGGGCTGCAAAAGATGGTGCCCCTGCTCAACAA 
GAACAACCCCAAGTTCCTGGCCATCACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCT 
GATCATCCTGGCCAATGGTGGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTATGAAAAGCTGCTCTG 
GACCACCAGTCGTGTGCTCAAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGGAGGCTGGTGGGAT 
GCAGGCCCTGGGCAAGCACCTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGACCCTGCGCAACCT 
CTCAGATGTGGCCACCAAGCAGGAGGGCCTGGAGAGTGTGCTGAAGATTCTGGTGAATCAGCTGAGTGTGGATGA 
CGTCAACGTCCTCACCTGTGCCACGGGCACACTCTCCAACCTGACATGCAACAACAGCAAGAACAAGACGCTGGT 
GACACAGAACAGCGGTGTGGAGGCTCTCATCCATGCCATCCTGCGTGCTGGTGACAAGGACGACATCACGGAGCC 
TGCCGTCTGCGCTCTGCGCCACCTCACTAGCCGNCACCCTGAGGNCGAGAACGGCCTGGCCCCAGTAACGGGCCC 
CTCTNTGCAGGCTTTCCTCCTCTCTAGAANCTCCTTCTGTTGGAAGGCCCTCCG 

SEQIDNO:394 

>GL1_34_HIGH_3_G3F1 Direction: N/A 

GAGAGGCGCCATNGNCGGTGGCAGCGGTGGCTCCAGTGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGA 

CACAGCCCGCTGCACCACCAGCATCCTGCACAACCTCTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTC 

GGGTGGCATCCCTGCTCTGGTCCGCATGCTCAGCTCCCCTGATGGAGTCGGTCCTGTTCTATGCCATCACCACGC 

TGCACAACCTGCTCCTGTACCAGGAGGGCGCCAAGATGGCCGTGCGCCTGGCCGACGGGCTGCAAAAGATGGTGC 

CCCTGCTCAACAAGAACAACCCCAAGTTCCTGGCCATCACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACC 

AGGAGAGCAAGCTGATCATCCTGGCCAATGGTGGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTATG 

AAAAGCTGCTCTGGACCACCAGTCGTGTGCTCAAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGG 

AGGCTGGTGGGATGCAGGCCCTGGGCAAGCACCTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGA 

CCCTGCGCAACCTCTCAGATGTGGCCACCAAGCAGGAGGGCCTGGAGAGTGTGCTGAAGATTCTGGTGAATCAGC 

TGAGTGTGGATGACGTCAACGTCCTCACCTGTGCCACGGGCACACTCTCCAACCTGACATGCAACAACAGCAAGA 

ACAAGACGCTGGTGACACAGAACAGCGGTGTGGAGGCTCTCATCCATGCCATCCTGCGTGCTGGTGACAAGGACG 

ACATCACGGAGCCTGCCGTCTGCGCTCTGNGCCANCTCATTAGCCGCCACCCTGAGGGCGAGATGGNCCAGAACT 
CTGTGCGTCNCAACTATGGCATCCCA 

SEQ ID NO:395 

>GL3_1.36_A3_M13F Direction: N/A 

TGGTGGTCCAGAGCAGCTTTTTATAACTGTAGTTACGCATGATCTGCACGAGGGCCTGGGGCCCACCATTGGCCA 

GGATGATCAGCTTGCTCTCCTGGTTGCCGTAGGCCAGGAGCTGNAGGCAGTCGGTGGTGATGGCCANGAACTTGG 

GGTTGTNCTTGTTGAGCAGGGGCACCATCTTTTGCAGCCCGTCGGCCAGGCGCACGGCCATCTTGGCGCCCTCCT 

GGTACAGGAGCAGGTTGTGCAGCGTGGTGATGGCATAGAACAGGACCGACTCCACAGGGGAGCTGATCATGCGGA 

CCAGAGCAGGGATGCCACCCGACTTGAAGATGGCGAGCAGCCCCTCCCGGTGGTGGGAGAGGTTGTGCAGGATGC 

TGGTGGTGCAGCGGGCTGTGTCCAGGTCGCTGGTATTCTGCATGGTACGCACGACAGCGGCCACCAGCTGGGGCG 

AGCCCATCANGGCCCGNCGCGACGCCTCCTTGTTCGACAGCTGGTCCACAATCATGGCCGNCTTGGTCACCACCA 
CCGGGTCCTCG 
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SEQ ID NO:396 

>GL3_1.36_A3_M13R Direction: ant i- sense 

CGGNCCGACCCCGACCCGGCCCGGTCAGGCCCCATACTCAGTAGCCACGATGGAGGTGATGAACCTGATGGAGCA 
GCCTATCAAGGTGACTGANTGGCAGCAGACATACACCTACGACTCGGGTATCCACTCGGGCGCCAACACCTGCGT 
5 GCCCTCCGNC^GCAGCAAGGGCATCATGGAGGAGGATGAGGCCTGCGGGCGCCAGTACACGCTCAAGAAAACCAC 
CACTTACACCCAGGGGGTGCCCCCCAGCCAAGGTGACCTGGAGTACCAGATGTCCACAACAGCCAGGGCCAAACG 
GGTGCGGGAGGCCATGTGCCCTGGTGTGTCAGGCGAGGACAGCTCGCTTCTGCTGGCCACCCAGGTGGAGGGGCA 
GGCCACCAACCTGCAGCGACTGGCCGAGCCGTCCCAGCTGCTCAAGTCGGCCATTGTGCATCTCATCAACTACCA 
GGACGATGCCGAGCTGGCCACTCGCGCCCTGCCCGAGCTCACCAAACTGCTCAACGACGAGGACCCGGTGGTGGT 

10 GACCAAGGCGGNCATGATTGTGAACCAGCTGTCGAACAAGGAGGCGTCGCGGCGGGCCCTGATGGGCTCGCCCCA 
GCTGGTGGCCGCTGTCGTGCGTACCATGCAGAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCT 
GCACAACCTCTCCCACCACCGGGAGGGGCTGCTCGCCATCTTCAAGTCGGGTGGNATCCCTGCTCTGGTCCGCAT 
GCTCAGCTCCCTGTGGAGTCGGTCCTGTTCTATGCCATCACCACCTGWACAACCTGCTCCTGTACCAGGAGGGGC 
GCCAANATGGCCGTGCGCCTGGCCGACGGNTGGAAAANATGTGCCCTGCTCAACAGAACACCCCAAGTTCTGGCA 

15 TCACACGAAT 

SEQIDNO:397 

>GL3_10 .28JPCRJ33F1 Direction: N/A 

NGGTGGTCCAGANCAGCTTTTTATAACTGTAGTTACGCATGATCTGCACNAGGGCCTGGGGCCCACCATATGGCC 
20 AGGATGATCAGCTTGCTCTCCTGGTTGCCGTAGGCCAGGAGCTGCAGGCAGTCGGTGGTGATGGCCAGGAACTTG 
GGGTTGTTCTTGTTGAGCAGGGGCACCATCTTTTGCAGCCCGTCGGCCAGGCGCACGGCCATCTTGGCGCCCTCC 
TGGTACAGGAGCAGGTTGTGCAGCGTGGTGATGGCATAGAACAGGACCGACTCCACAGGGGAGCTGAGCATGCGG 
ACCAGAGCAGGGATGCCACCCGACTTGAANATGGCGAGCAGCCCCTCCCGGTGGTGGGAGAGGTTGTGCAGGATG 
CTGGTGGTGCAGCGGGCTGTGTCCAGGTCGCTGGTATTCTGCATGGTACGCACGACAGCGGCCACCAGCTGGGGC 
25 GAGCCCATCAGGGNCCCGCCGCGACGCCTCCTTGTTCGACAGCTGGTTCACAATCATGGCCGCCTTGGTCACCAC 
CACCGGGT 

SEQIDNO:398 

>GL3_9.60_PCR_G3F1 Direction: N/A 

30 TGGTGGTCCAGAGCAGCTTTTTATAACTGTAGTTACGCATGATCTGCACGAGGGCCTGGGGCCCACCATTGGCCA 
GGATGATCAGCTTGCTCTCCTGGTTGCCGTAGGCCAGGAGCTGCAGGCAGTCGGTGGTGATGGCCAGGAACTTGG 
GGTTGTTCTTGTTGAGCAGGGGCACCATCTTTTGCAGCCCGTCGGCCAGGCGCACGGCCATCTTGGCGCCCTCCT 
GGTACAGGAGCAGGTTGTGCAGCGTGGTGATGGCATAGAACAGGACCGACTCCACAGGGGAGCTGAGCATGCGGA 
CCAGAGCAGGGATGCCACCCGACTTGAAGATGGCGAGCAGCCCCTCCCGGTGGTGGGAGAGGTTGTGCAGGATGC 

35 TGGTGGTGCAGCGGGCTGTGTCCAGGTCGCTGGTATTCTGCATGGTACGCACGACAGCGGCCACCAGCTGGGGCG 
AGCCCATCAGGGCCCGCCGCGACGCCTCCTTGTTCGACAGCTGGTTCACAATCATGGCCGCCTTGGTCACCACCA 
CCGGGTCCTCGTCGTTGAGCAGTTTGGTGAGCTCGGGCAGGGCGCGAGTGGCCAGCTCGGCATCGTCCTGGTAGT 
TGATGAGATGCACAATGGCCGACTTGAGCAGCTGGGACGGCTCGGCCAGTCGCTGCAGGTTGGTGGCCTGCCCCT 
CCACCTGGGTGGCCAGCAGAAGCGAGCTGTCCTCGCCTGACACACCAGGGCACATGGCCTCCCGCACCCGTTTGG 

40 CCCTGGCTGTTGTGGACATCTGGTACTCCAGGTCACCTTGGCTTGGGGGGCACCCCCTGGGTGTAAGTGGTGGTT 
TTCTTGAGCGTGTACTGGCGCCCGCAGGCCTCATCCTCCTCCATGATGCCCTTGCTGC 

SEQ ID NO:399 

>GL3_9.66_PCR__G3F1 Direction: N/A 

45 TGGTGGTCCAGAGCAGCTTTTTATAACTGTAGTTACGCATGATCTGCACGAGGGCCTGGGGCCCACCATTGGCCA 
GGATGATCAGCTTGCTCTCCTGGTTGCCGTAGGCCAGGAGCTGCAGGCAGTCGGTGGTGATGGCCAGGAACTTGG 
GGTTGTTCTTGTTGAGCAGGGGCACCATCTTTTGCAGCCCGTCGGCCAGGCGGACGGCCATCTTGGCGCCCTCCT 
GGTACAGGAGCAGGTTGTGCAGCGTGGTGATGGCATAGAACAGGACCGACTCCACAGGGGAGCTGAGCATGCGGA 
CCAGAGCAGGGATGCCACCCGACTTGAAGATGGCGAGCAGCCCCTCCCGGTGGTGGGAGAGGTTGTGCAGGATGC 

50 TGGTGGTGCAGCGGGCTGTGTCCAGGTCGCTGGTATTCTGCATGGTACGCACGACAGCGGCCACCAGCTGGGGCG 
AGCCCATCAGGGCCCGCCGCGACGCCTCCTTGTTCGACAGCTGGTTCACAATCATGGCCGCCTTGGTCACCACCA 
CCGGGTCCTCGTCGTTGAGCAGTTTGGTGAGCTCGGGCAGGGCGCGAGTGGCCAGCTCGGCATCGTCCTGGTAGT 
TGATGAGATGCACAATGGCCGACTTGAGCAGCTGGGACGGCTCGGCCAGTCGCTGCAGGTTGGTGGCCTGCCCCT 
CCACCTGGGTGGCCAGCAGAAGCGAGCTGTCCTCGCCTGACACACCAGGGCACATGGCCTCCCGCACCCGTTTGG 

5 5 CCCTGGCTGTTGTGGACATCTGG 

SEQ ID NO:400 

>GL3_9.94_PCR_G3F1 Direction: N/A 
TGGTGGTCCAGAGCAGCTTTTTATAACTC 
60 GGATGATCAGCTTGCTCTCCTGGTTGCCGTAGGCCAGGAGCTGCAGGCAGTCGGTGGTGATGGCCAGGAACTTGG 
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GGTTGTTCTTGTTGAGCAGGGGCACCATCTTTTGCAGCCCGTCGGCCAGGCGCACGGCCATCTTGGCGCCCTCCT 
GGTACAGGAGCAGGTTGTGCAGCGTGGTGATGGCATAGAACAGGACCGACTCCACAGGGGAGCTGAGCATGCGGA 
CCAGAGCAGGGATGCCACCCGACTTGAAGATGGCGAGCAGCCCCTCCCGGTGGTGGGAGAGGTTGTGCAGGATGC 
TGGTGGTGCAGCGGGCTGTGTCCAGGTCGCTGGTATTCTGCATGGTACGCACGACAGCGGCCACCAGCTGGGGCG 
AGCCCATCAGGGGCCGCCGCGACGCCCTCCTTGTTCGACAGCTGGTTCACAATCATGGCCGCCTTTGGTCACCAC 

AGTTGATGAGATGCACAA GAGCAGTTTGG ^^ 

SEQ ID NO:401 

>GL1_34_HIGHPLKS_1 Direction: N/A 

CANGAGAGCAAGCTGATCATCCTGGCCAATGGTGGGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTAT 
GAAAAGCTGCTCTGGACCACCAGTCGTGTGCTCAAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTG 
GAGGCTGGTGGGATGCAGGCCCTGGGCAAGCACCTGACCAGCAACAGCCCCCGACCTGGTGCAGAACTGCCTGTG 
GACCCTGCGCAACCTCTCAGATGTGGCCACCAGGCAGGAGGGCCTGGAGAGTGTGCTGAAGATTCTGGTGAATCA 
GCTGAGTGTGGATGACGTCAACGTCCTCACCTGTGCCACGGGCACACTCTCCAACCTGACATGCAACAACAGCAA 
GAACAAGACGCTGGTGACACAGAACAGCGGTGTGGAGGCTCTCATCCATGCCATCCTGCGTGCTGGTGACAAGGA 
CGACATCACGGAGCCTGCCGTCTGCGCTCTGCGCCACCTCACTAGCCGCCACCCTGAGGCCGAGATGGCCCAGAA 
CTCTGTGCGTCTCAACTATGGCATCCCAGCCATCGTGAAGCTGCTCAACCAGCCCAACCAGrGGCCACTGGTCAA 
GGCAACCATCGGCTTGATCAGGAATCTGGCCCTGTGCCCCAGCCCAACCATGCCCCGCTGCAGGAGGCAGNGGTC 
ATCCCCGCCTCGTCCAACTGCTGGTGAANGCCCACCAGGATGCCCAGNGCCACGTANCTGCAGGCNCACCGCAGC 

SEQ ID NO:402 

>GL1_34_HIGHPLKS_2 Direction: N/A 

CCCTGNATAATACNTTAGTGTGGATGACGTCAANGTCCTCNWCTGCTGCCACGGGGACACTCTCCAACCTGACAT 
GCAACAACAGCAAGAACAAGACGCTGGTGACACAGAACAGCGGTGTGGAGGCTCTCATCCATGCCATCCTGCGAT 
GCTGGTGACAAGGACGACATCACGGAGCCTGCCGTCTGCGCTCTGCGCCACCTCACTAGCCGCCACCCTGAGGCC 
GAGATGGCCCAGAACTCTGTGCGTCTCAACTATGGCATCCCAGCCATCGTGAAGCTGCTCAACCAGCCCAACCAG 
TGGCCACTGGTCAAGGCAACCATCGGCTTGATCAGGAATCTGGCCCTGTGCCCAGCCAACCATGCCCCGCTGCAG 
GAGGCAGCGGTCATCCCCCGCCTCGTCCAACTGCTGGTGAAGGCCCACCAGGATGCCCAGCGCCACGTAGCTGCA 
GGCACACAGCAGCCCTACACGGATGGTGTGAGGATGGAGGAGATTGTGGAGGGCTGCACCGGAGCACTGCACATC 
CTCGCCCGGGACCCCATGAACCGCATGGAGATCTTCCGGCTCAACACCATTCCCCTGTTTGTGCAGCTCCTGTAC 

GCC £™ A * GCAGANGGG ^^^ 

GCTGCTGCCCGTCCTGTTCCGCATCTCCGANGACAANAACCCAGACTACCGGAANCGCGTOTCCGTGGAGCTCAC 
CAACTCCCTTCTTCANGCATGAACCCGGCTGCC 

SEQ ID NO:403 

>GL1__34_HIGHPLKS_3 Direction: N/A 

GCACACTGNACTGATCNGGTATCTGGCCCNNTGCCCAGCCAACCATGCCCCGCTGCAGGAGGCAGCGGATCATCC 
CCCGCCTCGCTCCAACTGCTGGTGAAGGCCCACCAGGATGCCCAGCGCCACGCrAGCTGCAGGCACACAGCAGCC 
CTACACGGATGGTGTGAGGATGGAGGAGATTGTGGAGGGCTGCACCGGAGCACTGCACATCCTCGCCCGGGACCC 
CATGAACCGCATGGAGATCTTCCGGCTCAACACCATTCCCCTGNTTTGTGCAGCTCCTGTACTCGTCGGTGGAGA 
ACA * C ^ GCGCGTGGCTGC ^^ 

AGGGGGCCTCGGCCCCACTCATGGAGTTGCTGCACTCCCGCAACGAGGGCACTGCCACCTACGCTGCTGCCGTCC 

TGTTCCGCATCTCCGAGGACAAGAACCCAGACTACCGGAAGCGCGTGTCCGTGGAGCTCACCAACTCCCTCTTCA 
AGCA ^ GAGGCGGCTGCCTGGGAGGCTGCCCAG AGCATGA^ 

^ CAGG ^ ACCGCCCCATGTACTCCAGCGATGTGCC CCTTGAACCCGCTGGAGATGCACATGGACATGG 

TACCCCATCGACACCTACAGNGANGGCCTCAGGCCCCCGTACCCCACTGNAGACCACATGCTGNNCTACGCGGCC 

TGGCCCCAGTACGGNCCCCTCTTTGCAGGCTTTTCCTCCTCTCTAGAAACCTCCTTCTGTTGGAGGCCCTCNCAA 

AACCCANCAGNACAACCCACCACAGNGGTTACATAGATAAAGCGNCCGNTCGACTANTCTGAGGTCTGATACTCA 
CTGACTGTCNTACACAGTGAACCTATAAANTAAAA (jATACTCA 

SEQ ID NO:404 

>GL1_34_HIGHPLKS_4 Direction: sense 

AACCCTNTGAATGNCGACCCTTTGTGTGAGCTGGCCCAGGACAAGGCAGGCGGCCGACGCCATTGATGCANAGGG 
GGCCTCGGCCCCACTCATGGAGTTGCTGCACTCCCGCAACGAGGGCACTGCCACCTACGCTGCTGCCGTCCTGTT 
CCGCATCTCCGAGGACAAGAACCCAGACTACCGGAAGCGCGTGTCCGTGGAGCTCACCAACTCCCTCTTCAAGCA 
TGACCCGGCTGCCTGGGAGGCTGCCCAGAGCATGATTCCCATCAATGAGCCCTATGGAGATGACTTGGATGCCAC 
CTACCGCCCCATGTACTCCAGCGATGTGCCCCTTGACCCGCTGGAGATGCACATGGACATGGATGGAGACTACCC 
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CATCGACACCTACAGCGACGGCCTCAGGCCCCCGTACCCCACTGCAGACCACATGCTGGCCTAGGCGGCCTGGCC 
CCAGTACGGCCCCCTCTTTGCAGGCTTTTCCTCCTCTCTAGAACCTCCTTCTGTTGGAGGCCCT 

SEQIDNO:405 

5 >GL1 34 HIGH_JL_G3R1 Direction: N/A 

AGGGCCTCCAACAGAAGGAGGTTCTAGAGAGGAGGAAAAGCCTGCAAAGAGGGGGCCGTACTGGGGCCAGGCCGC 

CTAGGCCANCATGTGGTCTGCAGTGGGGTACGGGGGCCTGAGGCCGTCGCTGTAGGTGTCGATGGGGTAGTCTCC 

ATCCATGTCCATGTGCATCTCCAGNGGGTCAAGGGGCACATCGCTGGAGTACATGGGGCGGTAGGTGGCATCCAA 

GTCATCTCCATAGGGCTCATTGATGGGAATC 

10 

SEQIDNO:406 , 
>gi|4504810|ref | NM_002230 . 1 | Homo sapiens junction plakoglobm (JUP) , , 
transcript variant 1, mRNA 

CGCCAGAGTCCGGAGCAGCCGCCGCCCGACCGCGCCGAGCTCAGTTCGCTGTCCGCGCCGGCTCCCACCC 
15 CGGCCCGACCCCGACCCGGCCCGGTCAGGCCCCATACTCAGTAGCCACGATGGAGGTGATGAACCTGATG 
GAGCAGCCTATCAAGGTGACTGAGTGGCAGCAGACATACACCTACGACTCGGGTATCCACTCGGGCGCCA 
ACACCTGCGTGCCCTCCGTCAGCAGCAAGGGCATCATGGAGGAGGATGAGGCCTGCGGGCGCCAGTACAC 
GCTCAAGAAAACCACCACTTACACCCAGGGGGTGCCCCCCAGCCAAGGTGACCTGGAGTACCAGATGTCC 
ACAACAGCCAGGGCCAAACGGGTGCGGGAGGCCATGTGCCCTGGTGTGTCAGGCGAGGACAGCTCGCTTC 
20 TGCTGGCCACCCAGGTGGAGGGGCAGGCCACCAACCTGCAGCGACTGGCCGAGCCGTCCCAGCTGCTCAA 
GTCGGCCATTGTGCATCTCATCAACTACCAGGACGATGCCGAGCTGGCCACTCGCGCCCTGCCCGAGCTC 
ACCAAACTGCTCAACGACGAGGACCCGGTGGTGGTGACCAAGGCGGCCATGATTGTGAACCAGCTGTCGA 
AGAAGGAGGCGTCGCGGCGGGCCCTGATGGGCTCGCCCCAGCTGGTGGCCGCTGTCGTGCGTACCATGCA 
GAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCACAACCTCTCCCACCACCGGGAG 
25 GGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCTGCTCTGGTCCGCATGCTCAGCTCCCCTGTGGAGT 
CGGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGCCAAGATGGCCGT 
GCGCCTGGCCGACGGGCTGCAAAAGATGGTGCCCCTGCTCAACAAGAACAACCCCAAGTTCCTGGCCATC 
ACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGGTG 
GGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTATGAAAAGCTGCTCTGGACCACCAGTCGTGT 
30 GCTCAAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGGAGGCTGGTGGGATGCAGGCCCTG- 
GGCAAGCACCTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGACCCTGCGCAACCTCTCAG 
ATGTGGCCACCAAGCAGGAGGGCCTGGAGAGTGTGCTGAAGATTCTGGTGAATCAGCTGAGTGTGGATGA 
CGTCAACGTCCTCACCTGTGCCACGGGCACACTCTCCAACCTGACATGCAACAACAGCAAGAACAAGACG 
CTGGTGACACAGAACAGCGGTGTGGAGGCTCTCATCCATGCCATCCTGCGTGCTGGTGACAAGGACGACA 
35 TCACGGAGCCTGCCGTCTGCGCTCTGCGCCACCTCACTAGCCGCCACCCTGAGGCCGAGATGGCCCAGAA 
CTCTGTGCGTCTCAACTATGGCATCCCAGCCATCGTGAAGCTGCTCAACCAGCCCAACCAGTGGCCACTG 
GTCAAGGCAACCATCGGCTTGATCAGGAATCTGGCCCTGTGCCCAGCCAACCATGCCCCGCTGCAGGAGG 
CAGCGGTCATCCCCCGCCTCGTCCAACTGCTGGTGAAGGCCCACCAGGATGCCCAGCGCCACGTAGCTGC 
AGGCACACAGCAGCCCTACACGGATGGTGTGAGGATGGAGGAGATTGTGGAGGGCTGCACCGGAGCACTG 
40 CACATCCTCGCCCGGGACCCCATGAACCGCATGGAGATCTTCCGGCTCAACACCATTCCCCTGTTTGTGC 
AGCTCCTGTACTCGTCGGTGGAGAACATCCAGCGCGTGGCTGCCGGGGTGCTGTGTGAGCTGGCCCAGGA 
CAAGGAGGCGGCCGACGCCATTGATGCAGAGGGGGCCTCGGCCCCACTCATGGAGTTGCTGCACTCCCGC 
AACGAGGGCACTGCCACCTACGCTGCTGCCGTCCTGTTCCGCATCTCCGAGGACAAGAACCCAGACTACC 
GGAAGCGCGTGTCCGTGGAGCTCACCAACTCCCTCTTCAAGCATGACCCGGCTGCCTGGGAGGCTGCCCA 
45 GAGCATGATTCCCATCAATGAGCCCTATGGAGATGACATGGATGCCACCTACCGCCCCATGTACTCCAGC 
GATGTGCCCCTTGACCCGCTGGAGATGCACATGGACATGGATGGAGACTACCCCATCGACACCTACAGCG 
ACGGCCTCAGGCCCCCGTACCCCACTGCAGACCACATGCTGGCCTAGGCGGCCTGGCCCCAGTGACGGCC 
CCCTCTTTGCAGGCTTTTCCTCCTCTCTAGAACCTCCTTCTGTTGGAGGCCCTCCCATCTCCCCGCTGAA 
ACCTGCGCTCCTTTTTTGGGGGGATCCTTTGCTGCTGAGCTTCCCCAAGCACGGTGTGCCCTGGCCTGCC 
50 TTCTTCTTGTGTCTTTGGTGGGGATGGGGAGGCCTATTCCTGCTGGCCCCTTCTGGGGGTGGTGGGCAGG 
TGACACGGAGTGGCTTGAGCTTCTGGGGATGCAGGTCCACCGAGCCCCTGACCCCTGTCTGTCCCCGCTC 
CCCTAACAGGTGCGGTTCCTCATCTGAGAGGCTCTCCGTGCAGGCGATGGGGCAAGACAGAAAAGTGCCT 
GAGCTGGGGAAGCCGGGGTGTAACTTCCTGCTGCACCCTGCGCCTCCAGAGGTCCTCCGTAGGGTCTTTC 
TTGGGATAGTGTTCTGCTCCTGCTTTTCTGTCCTGGGCATGGGTCCAGGGCCTGACACCCCCTCCCCGCC 
55 CCTGTGGCCCTGGCCACTAAAGCTTCAGACTCAAGTACCCATTCTGTTTTCCCCCAGCAACGCCCCTCCA 
AACCTCCAGCCTCCCTGTCTCCAGCTGCCTGGGCCCGGAAGGGCTTTGGTTCCTTCTCTGGGTCTGATTT 
TCTCACTGAACTCCACCGACCAACTGCCCTAAGCCCCCAGGGCCTCCAGGGCCCAGGTTCGAGACCCAAA 
CCCCCAAAATCCAAAACTTCTCTTGAAAAGTTCAGGGACCGTCCAGGGGAGATGGGGAGGAGATATGGAG 
TGAGTCACCTGCTCCAGAAGATGCCAGCTTCTCTCTCCAGGGTGCTTAGTTGGCTTTGCCCACCCCTCAC 
60 TCCCCAGGGAGCTCCGGGGACAGCTTCCTCACACCCCTGTCCCACCCACACAGCTGCCCTAGCTGACCCC 
GAGAAGTGCTCTTGGCTGACCCCTCTGGTGTGTGGTGAGGGGCTTTCTCTTCCCCTTCCTGTTTCAGACC 
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CCCCCATTTCCCGCACATGGTGTGGGGGGCTGGGGGAGGTCCAAGCAGAGTGTTTTATTATTATCGCTTT 
ATGTTTTTGGTTATTGGTTTTTTTGTATAGACCAAAGCAAAGAAAATAAAAATAACACAG 

SEQ ID NO:407 

5 >gi| 4504811 |ref |NP_00222i. i| junction plakoglobin, isoforml; gamma- cat enin 
[Homo sapiens] 

MEVMNLMEQPIKVTEWQQTYTYDSGIHSGANTCVPSVSSKGIMEEDEACGRQYTLKKT 

DLEYQMSTTARAKRV*EAMCPGVSGEDSSLLLATQVEGQ 

TRALPELTKI*LNDEDPVVVTKAAMIVNQLSKXEASRRALMGSPQLVAA 
10 NLSHHREGLLAIFKSGGI PAL VRMLS SPVESVLFYAI TO 

NPKFLAITTDCLQLLAYGNQESKLIILANGGPQALVQIMRNYSYEKLLWTTSRVLKVLSVCPSNKPAIVE 

AGGMQALGKHLTSNSPRLVQNCLWTLRNLSDVATKQEGLESVLKILVNQLSVDDVNVLTCATGTLSNLTC 

NNS KNKTLVTQNS GVEAL IHAI LRAGD KDD ITE PAVCALRHLTSRHPEAEMAQNS VRLNYG I PAI VKLLN 

QPNQWPLVKATIGLIRNLALCPANHAPLQEAAVIPRLV^ 
15 EGCTGALHILARDPMNRMEIFRLNTI PLFVQLLYS SVENI QRVAAGVLCELAQDKEAADAIDAEGAS APL 

MELLHSRNEGTATYAAAVLPRISEDKNPDYRKRVSVELTNSLFKHDPAAWEAAQSMIPINEPYGD 

YRPMYS SDVPLDPLEMHMDMDGDYP IDTYSDGLRPPYPTADHMLA 

SEQ ID NO:408 

20 >gi 1 12056467 | ref | NM_02 1991 . 1 1 Homo sapiens junction plakoglobin (JTJP) , 
transcript variant 2, mRNA 

CGCCAGAGTCCGGAGCAGCCGCCGCCCGACCGCGCCGAGCTCAGTTCGCTGTCCGCGCCGGCTCCCACCC 
CGGCCCGACCCCGACCCGGCCCGGTCAGGCCCCATACTCAGTAGCCACGATGGAGGTGATGAACCTGATG 
GAGCAGCCTATCAAGGTGACTGAGTGGCAGCAGACATACACCTACGACTCGGGTATCCACTCGGGCGCCA 
25 ACACCTGCGTGCCCTCCGTCAGCAGCAAGGGCATCATGGAGGAGGATGAGGCCTGCGGGCGCCAGTACAC 
GCTCAAGAAAACCACCACTTACACCCAGGGGGTGCCCCCCAGCCAAGGTGACCTGGAGTACCAGATGTCC 
ACAACAGCCAGGGCCAAACGGGTGCGGGAGGCCATGTGCCCTGGTGTGTCAGGCGAGGACAGCTCGCTTC 
TGCTGGCCACCCAGGTGGAGGGGCAGGCCACCAACCTGCAGCGACTGGCCGAGCCGTCCCAGCTGCTCAA 
GTCGGCCATTGTGCATCTCATCAACTACCAGGACGATGCCGAGCTGGCCACTCGCGCCCTGCCCGAGCTC 
30 ACCAAACTGCTCAACGACGAGGACCCGGTGGTGGTGACCAAGGCGGCCATGATTGTGAACCAGCTGTCGA 
AGAAGGAGGCGTCGCGGCGGGCGCTGATGGGCTCGCCCCAGCTGGTGGCCGCTGTCGTGCGTACCATGCA 
GAATACCAGCGACCTGGACACAGCCCGCTGCACCACCAGCATCCTGCACAACCTCTCCCACCACCGGGAG 
GGGCTGCTCGCCATCTTCAAGTCGGGTGGCATCCCTGCTCTGGTCCGCATGCTCAGCTCCCCTGTGGAGT 
CGGTCCTGTTCTATGCCATCACCACGCTGCACAACCTGCTCCTGTACCAGGAGGGCGCCAAGATGGCCGT 
35 GCGCCTGGCCGACGGGCTGCAAAAGATGGTGCCCCTGCTCAACAAGAACAACCCCAAGTTCCTGGCCATC 
ACCACCGACTGCCTGCAGCTCCTGGCCTACGGCAACCAGGAGAGCAAGCTGATCATCCTGGCCAATGGTG 
GGCCCCAGGCCCTCGTGCAGATCATGCGTAACTACAGTTATGAAAAGCTGCTCTGGACCACCAGTCGTGT 
GCTCAAGGTGCTATCCGTGTGTCCCAGCAATAAGCCTGCCATTGTGGAGGCTGGTGGGATGCAGGCCCTG 
GGCAAGCACCTGACCAGCAACAGCCCCCGCCTGGTGCAGAACTGCCTGTGGACCCTGCGCAACCTCTCAG 
40 ATGTGGCCACCAAGCAGGAGGGCCTGGAGAGTGTGCTGAAGATTCTGGTGAATCAGCTGAGTGTGGATGA 
CGTCAACGTCCTCACCTGTGCCACGGGCACACTCTCCAACCTGACATGCAACAACAGCAAGAACAAGACG 
CTGGTGACACAGAACAGCGGTGTGGAGGCTCTCATCCATGCCATCCTGCGTGCTGGTGACAAGGACGACA 
TCACGGAGCCTGCCGTCTGCGCTCTGCGCCACCTCACTAGCCGCCACCCTGAGGCCGAGATGGCCCAGAA 
CTCTGTGCGTCTCAACTATGGCATCCCAGCCATCGTGAAGCTGCTCAACCAGCCCAACCAGTGGCCACTG 
45 GTCAAGGCAACCATCGGCTTGATCAGGAATCTGGCCCTGTGCCCAGCCAACCATGCCCCGCTGCAGGAGG 
CAGCGGTCATCCCCCGCCTCGTCCAACTGCTGGTGAAGGCCCACCAGGATGCCCAGCGCCACGTAGCTGC 
AGGCACACAGCAGCCCTACACGGATGGTGTGAGGATGGAGGAGATTGTGGAGGGCTGCACCGGAGCACTG 
CACATCCTCGCCCGGGACCCCATGAACCGCATGGAGATCTTCCGGCTCAACACCATTCCCCTGTTTGTGC 
AGCTCCTGTACTCGTCGGTGGAGAACATCCAGCGCGTGGCTGCCGGGGTGCTGTGTGAGCTGGCCCAGGA 
50 CAAGGAGGCGGCCGACGCCATTGATGCAGAGGGGGCCTCGGCCCCACTCATGGAGTTGCTGCACTCCCGC 
AACGAGGGCACTGCCACCTACGCTGCTGCCGTCCTGTTCCGCATCTCCGAGGACAAGAACCCAGACTACC 
GGAAGCGCGTGTCCGTGGAGCTCACCAACTCCCTCTTCAAGCATGACCCGGCTGCCTGGGAGGCTGCCCA 
GAGCATGATTCCCATCAATGAGCCCTATGGAGATGACATGGATGCCACCTACCGCCCCATGTACTCCAGC 
GATGTGCCCCTTGACCCGCTGGAGATGCACATGGACATGGATGGAGACTACCCCATCGACACCTACAGCG 
55 ACGGCCTCAGGCCCCCGTACCCCACTGCAGACCACATGCTGGCCTAGGCGGCCTGGCCCCAGTGCGGTTC 
CTCATCTGAGAGGCTCTCCGTGCAGGCGATGGGGCAAGACAGAAAAGTGCCTGAGCTGGGGAAGCCGGGG 
TGTAACTTCCTGCTGCACCCTGCGCCTCCAGAGGTCCTCCGTAGGGTCTTTCTTGGGATAGTGTTCTGCT 
CCTGCTTTTCTGTCCTGGGCATGGGTCCAGGGCCTGACACCCCCTCCCCGCCCCTGTGGCCCTGGCCACT 
AAAGCTTCAGACTCAAGTACCCATTCTGTTTTCCCCCAGCAACGCCCCTCCAAACCTCCAGCCTCCCTGT 
60 CTCCAGCTGCCTGGGCCCGGAAGGGCTTTGGTTCCTTCTCTGGGTCTGATTTTCTCACTGAACTCCACCG 
ACCAACTGCCCTAAGCCCCCAGGGCCTCCAGGGCCCAGGTTCGAGACCCAAACCCCCAAAATCCAAAACT 
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TCTCTTGAAAAGTTCAGGGACCGTCCAGGGGAGATGGGGAGGAGATATGGAGTGAGTCACCTGCTCCAGA 
AGATGCCAGCTTCTCTCTCCAGGGTGCTTAGTTGGCTTTGCCCACCCCTCACTCCCCAGGGAGCTCCGGG 
GACAGCTTCCTCACACCCCTGTCCCACCCACACAGCTGCCCTAGCTGACCCCGAGAAGTGCTCTTGGCTG 
ACCCCTCTGGTGTGTGGTGAGGGGCTTTCTCTTCCCCTTCCTGTTTCAGACCCCCCCATTTCCCGCACAT 
5 GGTGTGGGGGGCTGGGGGAGGTCCAAGCAGAGTGTTTTATTATTATCGCTTTATGTTTTTGGTTATTGGT 
TTTTTTGTATAGACCAAAGCAAAGAAAATAAAAATAACACAG 

SEQ ID NO:409 

>gi| 12056468 |ref |NP_068831 . 1 | junction plakoglobin, isoform 1; gamma- 
10 catenin [Homo sapiens] 

MEVMNLMEQPIKVTEWQQTYTYDSGIHSGANTCVPSVSSKGIMEEDEACGRQYTLKKTTTYTQGVPPSQG 

DLEYQMSTTARAKRV1U3AMCPGVSGEDSSIjLIAT^ 

TRALPELTKLIjNDEDPVVVTKAAMIVNQLSKKEASRRAIjMGSPQLVA 

J^SHHREGLLAIFKSGGIPALVRMLSSPVESVLFYAITTLHNLLLYQEGAK^ 
15 NPKFLAITTDCLQIiLAYGNQESKL 

AGGMQAIiGKHLTSNSPRLVQNCLWTLRNLSDVATK^ 

NNSKNKTLVTQNSGVEALIHAILRAGDKDDITEPAVCALRHLTSRHPEAEMAQNSVR 
QPNQWPLVKATIGLIRNLALCPANHAPLQEAAVIPRL^ 
EGCTGALHILARDPMNRMEIFRLNTIPLFVQLLYS^ 
20 MELLHSRNEGTATYAAAVLFRISEDKNPDYRKRVSVELTNSLFKHDPAAWEAAQSM^ 
YRPMYSSDVPLDPLEMHMDMDGDYPIDTYSDGLRPPYPTADHMLA 

SEQIDNO:410 

Plasminogen activator inhibitor 1 (SERPINE1; PAI-1) 

25 >GL3_6E_1_M13F Direction: N/A 

AGGCANGNTACAAGTGTGATGGCAATGTGACTGGAACAGAAATAGTTTCTATCCAGGCACATCAATAAGCTCCCT 
GTAATGCCCCGTAGATTCCATCCTGCAAAAGTGGCCCTCAGTTGGCGACACCCACGTGTCTGCATGATCCCCGCA 
GTTGTGTGCTACGCACGTAGGACGAGCGTTGGAATGCAAGGTTGCCCCANATCGGTCCCGCANGGCTTCACCGTC 
TGGTTTGGATGACCTATCTAAGGGGAAGTTTCGTGCTTCTAAACGATTCTTCTTCTCCAAAGGGTCTCAAGGGGT 

30 CGGTAGACTATGGGAGCGATTTTGACCTTAAAAAGGTTCNCTTCCCAAGNGAGAGAAGGGCCCAACGGGAGTCCT 
TGGGACGAAGCGAGGGAGTCCTGTGCTTTTACTCAACATCTGTCTTAATCAATAGAGTGNCAAGATGCTCTATCG 
TGCTTTCCCCTTCAGAGAAGTGAGGTTCAATTGTAACAGCATAAC 

SEQIDNO:411 

35 >GL3_3 7G_PCRJ33F1 Direction: N/A 

TCGGAGTTTCTTCTTTCCCGATGCATCTCCAGTTTTGTCCCAGATGAAGGCGTCTTTCCCCAGGGTCAGGGGTTC 
CATCACTTGGCCCATGAAAAGGACTGTTCCTGTGGGGTTGTGCCGGAACAACAAAGAGGAAGGGTCTGTCCATGA 
TGATCTCCTCGGGGGCCATGCGGGCTGAGACTATGACAGGTGTGGATGAGGAGGCCACCGTGCCACTCTCGTTCA 
CCTCGATCTTCACTTTCTGCAGCGCCTGCGCGACGTGAGAGGCTCTTGGTCTGAAAGACTCGTGAAGTCAGCCTG 

40 AAACTGTCTGAACATGTCGGTCATTCCCAGGTTCTCTAGGGGCTTCCTGAGGTCGACTTCAGTCTCCAGGGAGAA 
CTTGGGCAGAACCAGGAGGCGGGGCAGCCTGGTCATGTTGCCTTTCCAGTGGCTGATGAGCTGGGCACTCAGAAT 
GTTGGTGAGGGCAGAGAGAGGCACCTCTTTTTCATAAGGGGCAGCAATGAACATGCTGAGGGTGTCCCCGTGGTA 
GGGCAGTTCCAGGATGTCGTATAATGGCCATCGGGCGTGGTGAAC 

45 SEQIDNO:412 

>GL3__6E_1_M13R Direction: anti- sense 

GACACACACAAAAGGTATGATCAGCAACTTGCTTGGGAAAGGAGCCGTGGACCAGCTGACACGGCTGGTGCTGGT 
GAATGCCCTCTACTTCAACGGCCAGTGGAAGACTCCCTTCCCCGACTCCAGCACCGACCGCCGCCTCTTCCACAA 
ATCAGACGGCAGCACTGTCTCTGTGCCCATGATGGCTCAGACCAACAAGTTCAACTATACTGAGTTCACCACGCC 
50 CGATGGCCATTACTACGACATCCTGGAACTGCCCTACCACGGGGACACCCTCAGCATGTTCATTGCTGCCCCTTA 
TGAAAAAGAGGTGCCTCTCTCTGCCCTCACCAACATTCTGAGTGCCCAGCTCCATCAAGCCACTGGAAAGGCAAC 
ATGACCAGGGGC 

SEQIDNO:413 

55 >GL3_6E_2_M13F Direction: anti -sense 

GACAGACACAAAAGGTATGATCAGCAACTTGCTTGGGAAAGGAGCCGTGGACCAGCTGACACGGCTGGTGCTGGT 
GAATGCCCTCTACTTCAACGGCCAGTGGAAGACTCCCTTCCCCGACTCCAGCACCCACCGCCGCCTCTTCCACAA 
ATCAGACGGCAGGACTGTCTCTGTGCCCATGATGGCTCAGACCAACAAGTTCAACTATACTGAGTTCACCACGCC 
CGATGGCCATTACTACGACATCCTGGAACTGCCCTACCACGGGGACACCCTCAGCATGTTCATTGCTGCCCCTTA 

60 TGAAAAAGAGGTGCCTCTCTCTGCCCTC^CCAACATTCTC 
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GACCAGGCTGCCCCGCCTCCTGGTTCTGCCCAAGTTCTCCCTGGAGACTGAAGTCGACCTCAGGAAGCCCCTAGA 

GAACCTGGGAATGACCGACATGTTCAGACTCGTTTCAGGCTGACTTCACGAGTCTTTCAGACCAAGAGCCTCTCC 
ACGTCGCGCAGGCCTGAG 

SEQIDNO:414 

>GL3_6E_3_M13F Direction: anti-sense 

GACACACACAAAAGGTATGATCAGCAACTTGCTTGGGAAAGGAGCCGTGGACCAGCTGACACGGCTGGTGCTGGT 
GAATGCCCTCTACTTCAACGGCCAGTGGAAGACTCCCTTCCCCGACTCCAGCACCCACCGCCGCCTCTTCCACAA 
ATCAGACGGCAGCACTGTCTCTGTGCCCATGATGGCTCAGACCAACAAGTTCAACTATACTGAGGTTCACCACGC 
CCGATGGCCATTACTACGACATCCCTGGAACTGCCCTACCACGGGGACACCCTCAGGATGTTCATTGCTGCCCCT 



SEQBDNO:415 

>GL3_3 6B__PCR_G3F1 Direction: N/A 

AGGCAGTACAAGTGTGATGGCAATGTGACTGGAACAGAAATAGTTTCTACCAGGCACACAAAAGCTCCTGTAAGC 
CCCGTAGTTCCATCCTGCAAAGGGCCTCAGTGGGAACCAGTCTTGGCAACCCATGGGCAGAGAGACGGGTGGAAG 
CAGGTGCCCCAGATGGTCCCGCAGGCGTCACCGTCTGGTTTGGAGACCTTAAGGGAGTTGTGCTTCAAACTTCTC 
TCCCAGGGGTCTCAGGTGGAGACTAGGGAGTTTGACCTAAAGGTCCTCCAAGGAGAGGCCAAGGTCTTGGAGACA 
GATCTGGTTTACCATCTTTTAACAAAAGGCAAATGTCTTCTCTTCCTTCAGAAAGAGTCATTAACACTAAAATTC 
TTTTCTTCGGAGTTTCTTCTTTCCCGATGCATCTCCAGTTTTGTCCCAGATGAAGGCGTCTTTCCCCAGGGGTCA 
GGGGTTCCATCACTTGGCCCATGAAAAGGACTGTTCCTGTGGGGTTGTGCCGGACCACAAAGAGG 

SEQIDNO:416 

>GL3_6E_2__M13R Direction: sense 

AGGCAGTACAAGTGTGATGGCAATGTGACTGGAACAGAAATAGTTTCTACCAGGCACACAAAAGCTCCTGTAAGC 
CCCGTAGTTCCATCCTGCAAAGGGGCCTCAGTGGGAACCAGGTCTGCAGACCCGAGTGGGCAGAGAGACGGGTGG 
AAGCAGGTGCCCCAGATGGTCCCGCAGGCGTCACCGTCTGGTTTGGAGACCTTAAGGGAGTTGTGCTTCAAACTT 
CTCTCCCAGGGTCTCAGGTGGAGACTAGGGAGTTTGACCTAAAGGTCCTCCAAGGAGAGGCCAAGGTCTTGGAGA 
CAGATCTGGTTTACCATCTTTTAACAAAAGGCAAATGTCTTCTCTTCCTTCAGAAAGAGTCATTAACACTAAAAT 
TCTTTTCTTCGGAGTTTCTTCTTTCCCGATGCATCTCCAGTTTTG 

SEQIDNO:417 

>GIi3_6E_3_M13R Direction: sense 

CAGCTGGANCGAGCTCGGATCACTTANTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTCTCTCGGCATGGACGA 
GCTGTACAAGGAGGAGGCGCCAAGGCCGGTGGCAGCGGTGGCTCCAGTGTGCTGGGTTCTTGCGGAGGCAGTACA 
AGTGTGATGGCAATGTGACTGGAACAGAAATAGTTTCTACCAGGCACACAAAAGCTCCTGTAAGCCCCGTAGTTC 
CATCCTGCAAAGGGGCCTCAGTGGGAACCAGGTCTGCAGACCCGAGTGGGCAGAGAGACGGGTGGAAGCAGGTGC 
CCCAGATGGTCCCGCAGGCGTCACCGTCTGGTTTGGAGACCTTAAGGGAGTTGTGCTTCAAACTTCTCTCCCAGG 
GTCTCAGGTGGAGACTAGGGAGTTTGACCTAAAGGTCCTCCAAGGAGAGGCCAAGGTCTTGGAGACAGATCTGGT 
TTACCATCTTTTAACAAAAGGCAAATGTCTTCTCTTCCTTCAGAAAGAGTCATTAACACTAAAATTCTTTTCT 

SEQIDNO:418 

>GL3_6E_PCR_G3F1 Direction: N/A 

AGGCAGTACAAGTGTGATGGCAATGTGACTGGAACAGT^AATAGTTTCTACCAGGCACACAAAAGCTCCTGTAAGC 

CCCGTAGTTCCATCCTGCAAAGGGCCTCAGTGGGAACCAGGTCTGCAGACCCAGTGGGCAGAGAGACGGGTGGAA 

GCAGGTGCCCCAGATGGTCCCGCAGGCGTCACCGTCTGGTTTGGAGACCTTAAGGGAGTTGTGCTTCAAACTTCT 

CTCCCAGGGTCTCAGGTGGAGACTAGGGAGTTTGACCTAAAGGTCCTCCAAGGAGAGGCCAAGGTCTTGGAGACA 

GATCTGGTTTACCATCTTTTAACAAAAGGCAAATGTCTTCTCTTCCTTCAGAAAGAGTCATTAACACTAAAATTC 

TTTTCTTCGGAGTTTCTTCTTTCCCGATGCATCTCCCGTTTTGTCCCAGATAAGGCGTCTTTCCCCAGGGTCAGG 
GGTTCCATCACTTGGCCCATGAAAAGGACTGTTCCT 

SEQ ID NO:419 

>gi | 10835158 | ref |NM_00 0602 . 1 1 Homo sapiens serine (or cysteine) proteinase 
inhibitor, clade E (nexin, plasminogen activator inhibitor type 1) , member 
1 (SERPINE1) , mRNA 

GAATTCCTGCAGCTCAGCAGCCGCCGCCAGAGCAGGACGAACCGCCAATCGCAAGGCACCTCTGAGAACT 
TCAGGATGCAGATGTCTCCAGCCCTCACCTGCCTAGTCCTGGGCCTGGCCCTTGTCTTTGGTGAAGGGTC 
TGCTGTGCACCATCCCCCATCCTACGTGGCCCACCTGGCCTCAGACTTCGGGGTGAGGGTGTTTCAGCAG 
GTGGCGCAGGCCTCCAAGGACCGCAACGTGGTTTTCTCACCCTATGGGGTGGCCTCGGTGTTGGCCATGC 
TCCAGCTGACAACAGGAGGAGAAACCCAGCAGCAGATTCAAGCAGCTATGGGATTCAAGATTGATGACAA 
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GGGCATGGCCCCCGCCCTCCGGCATCTGTACAAGGAGCTCATGGGGCCATGGAACAAGGATGAGATCAGC 

ACCACAGACGCGATCTTCGTCCAGCGGGATCTGAAGCTGGTCCAGGGCTTCATGCCCCACTTCTTCAGGC 

TGTTCCGGAGCACGGTCAAGCAAGTGGACTTTTCAGAGGTGGAGAGAGCCAGATTCATCATCAATGACTG 

GGTGAAGACACACACAAAAGGTATGATCAGCAACTTGCTTGGGAAAGGAGCCGTGGACCAGCTGACACGG 

CTGGTGCTGGTGAATGCCCTCTACTTCAACGGCCAGTGGAAGACTCCCTTCCCCGACTCCAGCACCCACC 

GCCGCCTCTTCCACAAATCAGACGGCAGCA.CTGTCTCTGTGCCCATGATGGCTCAGACCAACAAGTTCAA 

CTATACTGAGTTCACCACGCCCGATGGCCATTACTACGACATCCTGGAACTGCCCTACCACGGGGACACC 

CTCAGCATGTTCATTGCTGCCCCTTATGAAAAAGAGGTGCCTCTCTCTGCCCTCACCAACATTCTGAGTG 

CCCAGCTCATCAGCCACTGGAAAGGCAACATGACCAGGCTGCCCCGCCTCCTGGTTCTGCCCAAGTTCTC 

CCTGGAGACTGAAGTCGACCTCAGGAAGCCCCTAGAGAACCTGGGAATGACCGACATGTTCAGACAGTTT 

CAGGCTGACTTCACGAGTCTTTCAGACCAAGAGCCTCTCCACGTCGCGCAGGCGCTGCAGAAAGTGAAGA 

TCGAGGTGAACGAGAGTGGCACGGTGGCCTCCTCATCCACAGCTGTCATAGTCTCAGCCCGCATGGCCCC 

CGAGGAGATCATCATGGACAGACCCTTCCTCTTTGTGGTCCGGCACAACCCCACAGGAACAGTCCTTTTC 

ATGGGCCAAGTGATGGAACCCTGACCCTGGGGAAAGACGCCTTCATCTGGGACAAAACTGGAGATGCATC 

GGGAAAGAAGAAACTCCGAAGAAAAGAATTTTAGTGTTAATGACTCTTTCTGAAGGAAGAGAAGACATTT 

GCCTTTTGTTAAAAGATGGTAAACCAGATCTGTCTCCAAGACCTTGGCCTCTCCTTGGAGGACCTTTAGG 

TCAAACTCCCTAGTCTCCACCTGAGACCCTGGGAGAGAAGTTTGAAGCACAACTCCCTTAAGGTCTCCAA 

ACCAGACGGTGACGCCTGCGGGACCATCTGGGGCACCTGCTTCCACCCGTCTCTCTGCCCACTCGGGTCT 

GCAGACCTGGTTCCCACTGAGGCCCTTTGCAGGATGGAACTACGGGGCTTACAGGAGCTTTTGTGTGCCT 

GGTAGAAACTATTTCTGTTCCAGTCACATTGCCATCACTCTTGTACTGCCTGCCACCGCGGAGGAGGCTG 

GTGACAGGCCAAAGGCCAGTGGAAGAAACACCCTTTCATCTCAGAGTCCACTGTGGCACTGGCCACCCCT 

CCCCAGTACAGGGGTGCTGCAGGTGGCAGAGTGAATGTCCCCCATCATGTGGCCCAACTCTCCTGGCCTG 

GCCATCTCCCTCCCCAGAAACAGTGTGCATGGGTTATTTTGGAGTGTAGGTGACTTGTTTACTCATTGAA 

GCAGATTTCTGCTTCCTTTTATTTTTATAGGAATAGAGGAAGAAATGTCAGATGCGTGCCCAGCTCTTCA 

CCCCCCAATCTCTTGGTGGGGAGGGGTGTACCTAAATATTTATCATATCCTTGCCCTTGAGTGCTTGTTA 

GAGAGAAAGAGAACTACTAAGGAAAATAATATTATTTAAACTCGCTCCTAGTGTTTCTTTGTGGTCTGTG 

TCACCGTATCTCAGGAAGTCCAGCCACTTGACTGGCACACACCCCTCCGGACATCCAGCGTGACGGAGCC 

CACACTGCCACCTTGTGGCCGCCTGAGACCCTCGCGCCCCCCGCGCCCCCCGCGCCCCTCTTTTTCCCCT 

TGATGGAAATTGACCATACAATTTCATCCTCCTTCAGGGGATCAAAAGGACGGAGTGGGGGGACAGAGAC 

TCAGATGAGGACAGAGTGGTTTCCAATGTGTTCAATAGATTTAGGAGCAGAAATGCAAGGGGCTGCATGA 

CCTACCAGGACAGAACTTTCCCCAATTACAGGGTGACTCACAGCCGCATTGGTGACTCACTTCAATGTGT 

CATTTCCGGCTGCTGTGTGTGAGCAGTGGACACGTGAGGGGGGGGTGGGTGAGAGAGACAGGCAGCTCGG 

ATTCAACTACCTTAGATAATATTTCTGAAAACCTACCAGCCAGAGGGTAGGGCACAAAGATGGATGTAAT 

GCACTTTGGGAGGCCAAGGCGGGAGGATTGCTTGAGCCCAGGAGTTCAAGACCAGCCTGGGCAACATACC 

AAGACCCCCGTCTCTTTAAAAATATATATATTTTAAATATACTTAAATATATATTTCTAATATCTTTAAA 

TATATATATATATTTTAAAGACCAATTTATGGGAGAATTGCACACAGATGTGAAATGAATGTAATCTAAT 
AGAAGC 

SEQ ID NO:420 

>gi | 10835159 | ref |NP_000593 . 1 1 serine (or cysteine) proteinase inhibitor, 
clade E (nexin, plasminogen activator inhibitor type 1) , member 1; 
plasminogen activator inhibitor, type I [Homo sapiens] 

MQMS PALTCLVLGLALVFGEGS AVHHPPS YVAH^ PYGVASVLAMLQ 
LTTGGETQQQIQAAMGFKIDDKGMAPALRHL^ 

RSTVKQVDFS EVERARF I INDWV1CTHTKGMI SlsTLLGKGAVT)QLTRIi VIj VNAL YFNGQWKTPFPDS STHRR 
LFHKSDGSTVSVPMMAQTNKFNYTEFTTPDGHYYDILELPYHGDTLSMFIAAPYEKEVPL 
LISHWKGNMTRLPRLLVljPKFSLETEVDLRKPLENLGM 
VNESGTVASSSTAVIVSARMAPEEIIMDRPFLFWRHNPTGWLFMGQ 

SEQIDNO:421 
Proteosomal subunit Y> 

ACACTGCCCTCCTTCTTGAGGGTCCCAGCCTGCGATGATGATTCCCGCCATCAGGTCTTCCCGGTATCGGTAACA 
CATCTCCTTAAAGAGACTGGCTGCTGTGTGGACCAGTGGAGGCTCATTCAGTTCAATGCTGTGGAAACCGAGCTG 
GTAGGTGACAGCATCAGCTACTGCCTGGGTATCAGCAGCTGAGCCTGAGCGACAGCAGAAAATGCGGTCGTGAAT 
AGGTGTCAGCTTGTCAGTCACTCGATTGGCGATGTAGGACCCAGTGGTTGTTCTGGAGTCCGCCCCCAGAACCAC 
GCCCCCGTCAAACTGCACGGCCATGATAGTGGTCCCAGTGGAAACTTCTCGGCTTTCCCAGTCTGGAGTGAACGC 
CTCCGGCCCCCAAGCCGGTGCTGGCCCGGCTCCCCGAGCAGCTAGTAAGGTAG 
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SEQIDNO:422 

Rap2B 

>GL3_37F_1_M13F Direction: N/A 

CGAACTGCTCGGTGCCCGCCGTATCCAGGATCTCCAGCACCGACGGCGACGAGTCCACCTCAATCTCCTTGCGGT 
5 AAAAGTCTTCGATGGTCGGGTCGTACTTCTCGATGAAGGAGCCCGTCACGAACTGCACGGTGAGCGCGGACTTGC 
CCACGCCGCCCGAGCCCAGCACCACCACTTTGTACTCTCTCATGGCTCCGTCAGCGCTCTCGCCGCGCCTGCCGC 
GGCCCCGTCGGGGCTGCGCGCGGGAAAGGCTGGGCTTGGCGGGCTGGACTTCTCTTCCCCTTCCTCAGCTACGCA 
GGAAAAACCCAGGAGAGAGCAACCCAGAGAACGCAGGGCGGAAACCACCGAAACGGAAGGCGGGCCGCCAGGCCC 
GGCCCGGCGGCCGGCGGGCAAGGCGNTAATCCCTGGGGGCTAGCACGGGGTCCCGGGGGCCCACGGAAGGCAAGG 
10 TCACCGGNAGGCAGGCAAGGACANGACAGCCCCNGGNGGACGGGCAGAAAACCCGGGAANGAGGGGACAAATTGG 
CCCACGGCAAGCCTTTGGCGNCACGNGAAGAAAAACCACAAGGGCAACAAGGGGGGNGTTAACGNCATTAAGGAA 
TAACACAGCGCGGAGCCGAGAG 

SEQ ID NO:423 

15 >GL3_37F_4_M13F Direction: anti-sense 

CGCAGCTGCGGGCATTGTCCTCTCGGTTCGCCGCCCGGGCTGCTGCTGCCGCCGCGGACTGCTGCGGGGCCCGGA 
CCCGCACCCCAGGGATACGCTGCCGCCGTCGCCGGCCGGCCCGGGCCCGGCCTCCGTTCGGTGGTTTCCGCCCTG 
CGTTCTCTGGGTTGCTCTCTCCTGGGTTTTTCCTGCGTAGCTGAGGAAGGGGAAGAGAAGTCCAGCCGCCAAGCC 
CAGCCTTCCCCGGCGCAGCCCCGACGGGGCCGCCGGACAGGCGCGGGCGGAGAAGCGCTGACGGAGCCATGAGGA 

20 GAGTACACAAGTGGTGGTGCTAGGGCTCGAGGCAGGAGTGGGGCAAAGTCCGCAGACTTCAACCGTGGCAAGATA 
CGGTTGGAACACGGGGCATCCTTCAATCGNAGAAAAGGTACGAACCCGAACNCATCGAAAGAACTTTTTAACCGG 
CCAAGGNAGNAATGGAGAGTGGGAC CTCAGGNCTACGNC CCGTCCCGGGANGCTTGGAAGAATCC 

SEQ ID NO:424 

25 >GL3_37F_2__M13R Direction: sense 

CGAACTGCTCGGTGCCCGCCGTATCCAGGATCTCCAGCACCGACGGCGACGAGTCCACCTCAATCTCCTTGCGGT 
AAAAGTCTTCGATGGTCGGGTCGTACTTCTCGATGAAGGAGCCCGTCACGAACTGCACGGGTGAGCAGCGGACTT 
GCCCANAGACCGGACCGAGCCCAGACACACACCACTATATGTACTCATCTCAATGGCATCCGNCAGAGCTCATCG 
NCCGACAGACCTGCCGNNGGACCCCGATACAGGGGGGCATTGCAGCGCGGNGGAAAAGGGCATNGGGGNCATAGG 

30 ACAGGGACAATAGGGAACAGNTATCATCATTCACACACATATCGCATCAAAGCNATACGCAAAGGGAAAAACAA^ 
AACGCCCAAGAGAAGAAAGAGGCAAAACCCCCAAGGAANNGAAAACAACAGGGCCAACAAGAGGGGACCGGGGAG 
GAAAAAAA 

SEQIDNO:425 

35 >GL3_37F_3_M13F Direction: N/A 

CGAACTGCTCGGTGCCCGCCGTATCCAGGATCTCCAGCACCGACGGCGACGAGTCCACCTCAATCTCCTTGCGGT 
AAAAGTCTTCGATGGTCGGGTCGTACTTCTCGATGAAGGAGCCCGTCACGAACTGCACGGTGAGCGCGGACTTGC 
CCACGCCGCCCGAGCCCAGCACCACCACTTTGTACTCTCTCATGGCTCCGTCAGCGCTCTCGCCGCGCCTGCCGC 
GGCCCCGTCGGGGCTGCGCGCGGGGAAGGCTGGGCTTGGCAGGCTGGACTTCTCTTCCCCTTCCTCAGCTACGCA 

40 GGAAAAACCCAGGAGAGAGCAACCCAGAGAACGCAGGGCGGGAAACCACCGAACGGAGGCCGGGCGCCGGGCCGT 
CCGGCCGGACGANGGCAGTGCGGGGGCGAACCGAGAGGAACAAATGCCCGAGAGGCCCTTGCGGCCGAAAAA 

SEQ ID NO:426 

>GL3__37F__4_M13R Direction: sense 

45 CGAACTGCTCGGTGCCCGCCGTATCCAGGATCTCCAGCACCGACGGCGACGAGTCCACCTCAATCTCCTTGCGGT 
AAAAGTCTTCGATGGTCGGGTCGTACTTCTCGATGAAGGAGCCCGTCACGAACTGCACGGTGAGCGCGGACTTGC 
CCACGCCGCCCGAGCCCAGCACCACCACTTTGTACTCTCTCATGGCTCCGTCAGCGCTCTCGCCGCGCCTGCCGN 
GGCCCCGTCGGGGCTGCGCGCGGGGAAGGCTGGGCTTGGCGGCTGGACTTCTCTTCCCCTTCCTCAGNTACGCAG 
GAAAAACCCAGGAGGAGAGCAACCCAGGAGAACGCAGGGCGGAAAGCACGCGGAACGGAGTGCCGGGCGCCGGGC 

50 CGGCCCG 

SEQ ID NO:427 

>GL3_37F_ PCR_ G3F1 Direction: sense 

CGAACTGCTCGGTGCCCGCCGTATCCAGGATCTCCAGCACCGACGGCGACGAGTCCACCTCAATCTCCTTGCGGT 
55 AAAAGTCTTCGATGGTCGGGTCGTACTTCTCGATGAAGGAGCCCGTCACGAACTGCACGGTGAGCGCGGACTTGC 
CCACGCCGCCCGAGCCCAGCACCACCACTTTGTACTCTCTCATGGCTCCGTCAGCGCTCTCGCCGCGCCTGCCGC 
GGCCCCGTCGGGGCTGCGCGCGGGGAAGGCTGGGCTTGGCGGCTGGACTTCTCTTCCCCTTCCTCAGCTACGCAG 
GAAAAACCCAGGAGAGAGCAACCCAGAGAACGCAGGGCGGAAACCACCGAACGGAGGCCGGGCGCCGGGCCGGCC 
GGCGGCGGCGGCAGCGTATCCCTGGGGTGCGGGTCCGGGCCCCGCAGCAGTCCGCGGCGGCAGCAGCAGCCCGGG 
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CGGCGAACCGAGAGGACAATGCCCGCAGCCTGCGCCGCAGTVACCCAGCACAGTGGTTAGATAGATAAAGCGGCGC 
TCGACTAGTCTGAGGTCTGATACTCACTGACTGAA 

SEQ ID NO:428 

5 >gi | 16159529 | ref | XM_003032 . 2 | Homo sapiens RAP2B, member of RAS oncogene 
family (RAP2B) , mRNA 

GCACGTTCCGCGGGGACTCATGCCACGCGCGTCCCGGCCCGACGCGCAATTAGCAGCCACCTCCGCAGCC 
CGCCGCCACCGCCTCCCTGCCCTCCCGGGCTGCCGCAGCTAGGAGCTCCAGCCGTCGCCTCGCGCAGGCT 
GCGGGCATTGTCCTCTCGGTTCGCCGCCCGGGCTGCTGCTGCCGCCGCGGACTGCTGCGGGGCCCGGACC 

10 

CGCACCCCAGGGATACGCTGCCGCCGCCGCCGGCCGGCCCGGCGCCCGGCCTCCGTTCGGTGGTTTCCGC 
CCTGCGTTCTCTGGGTTGCTCTCTCCTGGGTTTTTCCTGCGTAGCTGAGGAAGGGGAAGAGAAGTCCAGC 
CGCCAAGCCCAGCCTTCCCCGGCGCGCAGCCCCGACGGGGCCGCGGCAGGCGCGGCGAGAGCGCTGACGG 
AGCCATGAGAGAGTACAAAGTGGTGGTGCTGGGCTCGGGCGGCGTGGGCAAGTCCGCGCTCACCGTGCAG 
TTCGTGACGGGCTCCTTCATCGAGAAGTACGACCCGACCATCGAAGACTTTTACCGCAAGGAGATTGAGG 

15 TGGACTCGTCGCCGTCGGTGCTGGAGATCCTGGATACGGCGGGCACCGAGCAGTTCGCGTCCATGCGGGA 
CCTGTACATCAAGAACGGCCAGGGCTTCATCCTGGTCTACAGCCTCGTCAACCAGCAGAGCTTCCAGGAC 
ATCAAGCCCATGCGGGACCAGATCATCCGCGTGAAGCGGTACGAGCGCGTGCCCATGATCCTGGTGGGCA 
ACAAGGTGGACCTGGAGGGTGAGCGCGAGGTCTCGTACGGGGAGGGCAAGGCCCTGGCTGAGGAGTGGAG 
CTGCCCCTTCATGGAGACGTCGGCCAAAAACAAAGCCTCGGTAGACGAGCTATTTGCCGAGATCGTGCGG 

20 CAGATGAACTACGCGGCGCAGCCCAACGGCGATGAGGGCTGCTGCTCGGCCTGCGTGATCCTCTGAGGCG 
GCCACCGCGCGCCGGCCGCGCTCTGCGCACAAAAGCCAAACGCATCCGACTCTCTAAATGTGATTTATTT 
CTTGCTTTGAGATTGGAGACCACTTTGCATTGGCCAGGGTGTCTTGGGAGCCCGGCTGGCCTCCGCGGCC 
GGCGTCCCCTGCCTCCACCCTGTGCCCGAGGGGGTGTCCGGTCCTGCCCATCCGATACTCTGGTGGAAAT 
GTGGCTCTTTGCAGCATGTACGTTTCTCCCTGATTTTGGTTGATGCATATTTCCCCGTTTAAGTAGCCGT 

25 TAGGGCGCAGTATCGGCAGCTTGACACCCACCAAGCAAAAGTTTCAGCCTGGAAAAAAAATGGGGGGGAA 
GGGTGGATGAAAAGGAGGGAGAGAAGGTGGAAATGGTTTTTTTTTTTTTTTTTCTATTTTCTTTCTTTTT 
TTTTTTTTTTTTTTTTGGTCAACAGCCGTTTTTCTAGTTCCAAGTTTTAAATACATGGAAGGAAGTCCGG 
GAGAACCATATGAAGGAGCAGGAGGAGAGGAAGAAACTTTTTTTCCTTCTTTTCCAGGAGTAGCTGGAAA 
TTAAGATCGGGTTCCTTTTCTGCCAGCTTGGAAGGGCAACCCCATGACTGATTGCGATTCTGAGGATGTC 

30 TATGCAAAGTTGGATTCTTGTTACAGTGTATCCAATCTGAAGTATTGCACATCTGAACTGGGACTGTTAA 
CACTGATGCCAATACAGTGTGGGGTGCCAGAAAGTGTCTGCTGATATTTGTGG 

SEQ ID NO:429 

>gi | 11433346 | ref |XP_003032 . 1 | RAP2B, member of RAS oncogene family [Homo 
35 sapiens] 

MRE YKVVVLGSGGVGKS ADTVQFVTGS F IEKYDPTIEDFYRKE IE VDS S PS VLE ILDTAGTEQFASMRDIi 

YIKNGQGFILVYSLVNQQSFQDIKPMRDQIIRVKRYERVPMILW 

PFMETSAKNKASVDELFAEIVRQMNYAAQPNGDEGCCSACVIL 

40 SEQ ID NO:430 
Semaphorin 3F 

>GL3_10 .20_A_ M13F Direction: anti-sense 

CGGCAGAAAACAGAGCCTGCCTAACCAGGCCCAGCCAGTTGGTGGGGCCAGGCCAGGACCACACAGTCCCCAGAC 
TCAGCTGGAAGTCTACCTGCTGGACAGCCTCCGCCAAGATCTACAGGACAAAGGGAGGGAGCAAGCCCTACTCGG 

45 ATGGGGCACGGACTGTCCACCTTTTCTGATGTGTGTTGTCAGCCTGTGCTGTGGCATAGACATGGATGCGAGGAC 
CACTTTGGAGACTGGGGTGGCCTCAAGAGC^CACAGAGAAGGGAAGAAGGGGCCATCACAGGATGCC^ 
CCTGGGTTGGGGGCACTCAGCCACGACCAGCCCCTTCCTGGGTATTTATTCTCTATTTATTGGGGATAGGAGAAG 
AGGCATCCTGCCTGGGTGGGACAGGCCCTTCAGCCCCTTCTCCNCTCCCCGCCTGGCCAGGGCAGGGCCACCCCA 
CTCTACCTCCTTAGCTTTCCCTGTGCCACTTTGACTCAGAGGCTGGGAGCATAGCAGAGGGGCCAGGCCCAGGCA 

50 GAGCTGACGGGAGGCCCCAGCTCTGAGGGGAGGGGGTC 



>GL3_9 . 67__A_M13R Direction: anti-sense 

CGGCAGCAAAACAGANCCTGCCTAACCAGGACCCANCCAGTTGGTGGGGCCAGGCCAGGACCACACAGTCCCCAG 
55 ACTCAGCTGGAAGTCTACCTGCTGGACAGCCTCCGCCAAGATCTACAGGACAAAGGGAGGGAGCAAGTCCTACTC 
GGATGGGGCACGGACTGTCCACCTTTTCTGATGTGTGTTGTCAGCCTGTGCTGTGGCATAGACATGGATGCGAGG 
ACCACTTTGGAGACTGGGGTGGCCTCAAGAGCACACAGAGAAGGGAAGAAGGGGCCATCACAGGATGCCAGCCCC 
TGCCTGGGTTGGGGGCACTCAGCCACGACCAGCCCCTTCCTGGGTATTTATTCTCTATTTATTGGGGATAGGAGA 
AGAGGCATCCTGCCTGGGTGGGACAGCCCCTTCAGCCCCTTCTCCNCTCCCCGACTGGCCAGGGCAGGGCCACCC 
60 CACTCTACCTCCTNANATTCCCCTGTGC 



SEQIDNO:431 
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SEQ ID NO:432 

>GL3_9.67_B_M13F Direction: N/A 

GCCAGTGTTTATTAGCAAGATGGAACCCAAAGGCGGCTGTGGCCTGGGCAGCAGAAGGCCACCAGGAGCCCCCAC 
CCATCTACCCAACTGC CCCCCAGAGCTAATTACATCCACACCCATCC C CTGAAGTGGTGGAC ATAAGGG AGCC CT 
GGGGAGCCTCTACCGGCCCCAGGCCTCTACCACGGACCCCCTCCCCTCAGAGCTGGGGCCTCCCGTCAGCTCTGC 
CTGGGCCTGGCCCCTCTGCTATGCTCCCAGCCTCTGAGTCAAAGTGGCACAGGGAAAGCTAAGGAGGTAGAGTGG 
GGTGGCCCTGCCCTGGCCAGGCGGGGAGGGGAGAAGGGGCTGAAGGGGCTGTCCCACCCAGGCAGGATGCCTCTT 
CTCCTATCCCCAATAAATAGAGAATAAATACCCAGGAAGGGGCTGGTCGTGGCTGAGTGCCCCCAACCCAGGCAG 
GGGCTGGCATCCTGTGATGGCCCCTTCTTCCCTTCTCTGTGTGCTCTTGAGGCCACCCCAGTCTCCAAAGTGGTC 
CTCGCATCCATGTCTATGCCACAGCACAGGCTGACAACACACATCANAAAAGGTGGACAGTCCGTGCCCCATCCG 
AGTAGGGCTTGCTCCCTCCCTTTGTCCTGTAGATCTTGGCGGANGCTGTCCAGCAGGTANACTTCCAGCTGANTC 
TGGGGACTGTGTGGTCCTGGCCTGGCCCACCAACTGGNTGGCCTGGTTAGGCAGGTCTGTTTTCTTGC 

SEQIDNO:433 

>GL3_9. 67_B_M13R Direction: anti-sense 

CGGCAGAAAACAGAGCCTGCCTAACCAGG CCCAGCCAGTTGGTGGGGC C AGGCCAGG AC CAC ACAGTC CCCAGAC 
TCAGCTGGAAGTCTACCTGCTGGACAGCCTCCGCCAAGATCTACAGGACAAAGGGAGGGAGCAAGCCCTACTCGG 
ATGGGGCACGGACTGTCCACCTTTTCTGATGTGTGTTGTCAGCCTGTGCTGTGGCATAGACATGGATGCGAGGAC 
CACTTTGGAGACTGGGGTGGCCTCAAGAGCACACAGAGAAGGGAAGAAGGGGCCATCACAGGATGCCAGCCCCTG 
CCTGGGTTGGGGGCACTCAGCCACGACCAGCCCCTTCCTGGGTATTTATTCTCTATTTATTGGGGATAGGAGAAG 
AGGCATCCTGCCTGGGTGGGACAGCCCCTTCAGCCCCTTCTCCNCTCCCCGCCTGGCCAGGGCAGGGCCACCCCA 
CTCTACCTCCTTAGCTTTCCCTGTGCCACTTTGACTCAGAGGCTGGGAGCATAGCAGAGGGGCCAGGCCCAGGCA 
GAGCTGACGGGAGGCCCCAGCTCTGAGGGGAGGGGGTCCGTGGTAGAGGCCTGGGGCCGGTAGAGGCTCCCCAGG 
GCTCCCTTATGTCCACCACTTCAGGGGATGGGTGTG 

SEQ ID NO:434 

>GL3_9.67_C_M13F Direction: N/A 

GCCAGTGTTTATTAGCAAGATGGAACCCAAAGGCGGCTGTGGCCTGGGCAGCAGAAGGCCACCAGGAGCCCCCAC 
CCATCTACCCAACTGCCCCCCAGAGCTAATTACATCCACACCCATCCCCTGAAGTGGTGGACATAAGGGAGCCCT 
GGGGAGCCTCTACCGGCCCCAGGCCTCTACCACGGACCCCCTCCCCTCAGAGCTGGGGCCTCCCGTCAGCTCTGC 
CTGGGCCTGGCCCCTCTGCTATGCTCCCAGCCTCTGAGTCAAAGTGGCACAGGGAAAGCTAAGGAGGTAGAGTGG 
GGTGGCCCTGCCCTGGCCAGGCGGGGAGGGGAGAAGGGGCTGAAGGGGCTGTCCCACCCAGGCAGGATGCCTCTT 
CTCCTATCCCCAATAAATAGAGAATAAATACCCAGGAAGGGGCTGGTCGTGGCTGAGTGCCCCCAACCCAGGCAG 
GGGCTGGCATCCTGTGATGGCCCCTTCTTCCCTTCTCTGTGTGCTCTTGAGGNCACCCCAGTCTCCAAAGTGGTC 
CTCGCATCCATGTCTATGCCACAGCACAGGCTGACAACACACATCANAAAAGGTGGACAGTCCGTGCCCCATCCG 
ANTAGGGCTTGCTCCCTCCCTTTGTCCTGTAAATC 

SEQIDNO:435 

>GL3_9 . 67__D_M13R Direction: sense 

GCCAGTGTTTATTAGCAAGATGGAACCCAAAGGCGGCTGTGGCCTGGGCAGCAGAAGGCCACCAGGAGCCCCCAC 
CCATCTACCCAACTGCCCCCCAGAGCTAATTAC^TCC^CACCCATCCCCTGAAGTGGTGGACATAAGGGAGCCCT 
GGGGAGCCTCTACCGGCCCCAGGCCTCTACCACGGACCCCCTCCCCTCAGAGCTGGGGCCTCCCGTCAGCTCTGC 
CTGGGCCTGGCCCCTCTGCTATGCTCCCAGCCTCTGAGTCAAAGTGGCACAGGGAAAGCTAANGAGGTAGAGTGG 
GGTGGCCCTGCCCTGGCCAGGCGGGGAGGGGAGAAGGGGCTGAAGGGGCTGTCCCACCCAGGCAGGATGCCTCTT 
CTCCTATCCCCAATAAATAGAGAATAAATACCCAGNAAGGGGCTGGTCGTGGCTGAGTGCCCCCAACCCANGCAG 
GGGCTGGCATCCTGTGATGGCCCCTTCTTCCCTTCTCTGTGTGCTCTTGAGGCCACCCCAGTCTCCAAAGTGGTC 
CTCGCATCCATGTCTATGCCACAGCACAGGCTGACAACACACATCANAAAAAGGTGGACAG 

SEQ ID NO:436 

>GL3_9.67JPCR_G3F1 Direction: N/A 

GCCAGTGTTTATTAGCAAGATGGAACCCAAAGGCGGCTGTGGCCTGGGCAGCAGAAGGCCACCAGGAGCCCCCAC 
CCATCTACCCAACTGCCCCCCAGAGCTAATTACATCCACACCCATCCCCTGAAGTGGTGGACATAAGGGAGCCCT 
GGGGAGCCTCTACCGGCCCCAGGCCTCTACCACGGACCCCCTCCCCTCAGAGCTGGGGCCTCCCGTCAGCTCTGC 
CTGGGCCTGGCCCCTCTGCTATGCTCCCAGCCTCTGAGTCAAAGTGGCACAGGGAAAGCTAAGGAGGTAGAGTGG 
GGTGGCCCTGCCCTGGCCAGGCGGGGAGGGGAGAAGGGGCTGAAGGGGCTGTCCCACCCAGGCAGGATGCCTCTT 
CTCCTATCCCCAATAAATAGAGAATAAATACCCAGGAAGGGGCTGGTCGTGGCTGAGTGCCCCCAACCCAGGCAG 
GGGCTGGCATCCTGTGATGGCCCCTTCTTCCCTTCTCTGTGTGCTCTTGAGGCCACCCCAGTCTCCAAAGTGGTC 
CTCGCATCCATGTCTATGCCACAGCACAGGCTGACAACACACATCAGAAAAGGTGGACAGTCCGTGCCCCATCCG 
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AGTAGGGCTTGCTCCCTCCCTTTTGTCCTGTAGATCTTGGCGGAGGCTGTCCAGCAGGTAGACTTCCAGCTGAGT 
CTGGGGACTGTGTGGTCCTGGCCTGGCCCCACCAACTGGCTGGGCCTGGTTAGGCAGGCTCTGTTTTCTGCCG 

SEQ ID NO:437 

5 >gi 1 1061350 |gb|U38276.l|HSU38276 Human semaphorin III family homolog mRNA, 
complete cds 

CTGACTGGTGCTCCCTCTCTTTCATCTTGGGCTGTCTGCATGTGTCTCATTCCCCCACTCTCTCCTGTGC 
CTCCCCTCTACTCTAATAATCAGGTCCAGGTTTCTCTGTACTGGGAGAAGACCTGTGGCTGGAGCAGGCA 
GGGATGCACCCTATCTGTTCCCCATTCCTCCAGGTGGGAGGGAGAAGGAGTAACCCACTTTATTGGCCAC 
•* 10 AGATGCAGGGGAGAAAGGAGAAAGCATGCTGGGAGCTGGAAAGAGCCCTAAGATCACCTGGTTTCTAGAG 
AGTGGAGCCTGCTTCCTGCCTAGGCCCCTCCCACAATGCTTGTCGCCGGTCTTCTTCTCTGGGCTTCCCT 
ACTGACTGGGGCCTGGCCATCCTTCCCTACCCAGGACCACCTCCCGGCCACGCCCCGGGTACGGCTCTCA 
TTCAAAGAGCTGAAGGCCACAGGCACCGCCCACTTCTTCAACTTCCTGCTCAACACAACCGACTACCGAA 
TCTTGCTCAAGGACGAGGACCACGACCGCATGTACGTGGGCAGCAAGGACTACGTGCTGTCCCTGGACCT 

15 GCACGACATCAACCGCGAGCCCCTCATTATACACTGGGCAGCCTCCCCACAGCGCATCGAGGAATGCGTG 
CTCTCAGGCAAGGATGTCAACGGCGAGTGTGGGAACTTCGTCAGGCTCATCCAGCCCTGGAACCGAACAC 
ACCTGTATGTGTGCGGGACAGGTGCCTACAACCCCATGTGCACCTATGTGAACCGCGGACGCCGCGCCCA 
GGATTACATCTTCTACCTGGAGCCTGAGCGACTCGAGTCAGGGAAGGGCAAGTGTCCGTACGATCCCAAG 
CTGGACACAGCATCGGCCCTCATCAATGAGGAGCTCTATGCTGGTGTGTACATCGATTTTATGGGCACTG 

20 ATGCAGCCATCTTCCGCACACTTGGAAAGCAGACAGCCATGCGCACGGATCAGTACAACTCCCGGTGGCT 
GAACGACCCGTCGTTCATCCATGCTGAGCTCATTCCTGACAGTGCGGAGAATGATGATAAGCTTTACTTC 
TTCTTCCGTGAGCGGTCGGCAGAGGCGCCGCAGAGCCCCGCGGTGTACGCCCGCATCGGGCGCATTTGCC 
TGAACGATGACGGTGGTCACTGTTGCCTGGTCAACAAGTGGAGCACATTCCTGAAGGCGCGGCTCGTCTG 
CTCTGTCCCGGGCGAGGATGGCATTGAGACTCACTTTGATGAGCTCCAGGACGTGTTTGTCCAGCAGACC 

25 CAGGACGTGAGGAACCCTGTCATTTACGCTGTCTTTACCTCCTCTGGCTCCGTGTTCCGAGGCTCTGCCG 
TGTGTGTCTACTCCATGGCTGATATTCGCATGGTCTTCAACGGGCCCTTTGCCCACAAAGAGGGGCCCAA 
CTACCAGTGGATGCCCTTCTCAGGGAAGATGCCCTACCCACGGCCGGGCACGTGCCCTGGTGGAACCTTC 
ACGCCATCTATGAAGTCCACCAAGGATTATCCTGATGAGGTGATCAACTTCATGCGCAGCCACCCACTCA 
TGTACCAGGCCGTGTACCCTCTGCAGCGGCGGCCCCTGGTAGTCCGCACAGGTGCTCCCTACCGCCTTAC 

30 CACTATTGCCGTGGACCAGGTGGATTCAGCCGACGGGCGCTATGAGGTGCTTTTCCTGGGCACAGACCGC 
GGGACAGTGCAGAAGGTCATTGTGCTGCCCAAGGATGACCAGGAGATGGAGGAGCTCATGCTGGAGGAGG 
TGGAGGTCTTCAAGGATCCAGCACCCGTCAAGACCATGACCATCTCTTCTAAGAGGCAACAACTCTACGT 
GGCGTCAGCCGTGGGTGTCACACACCTGAGCCTGCACCGCTGCCAGGCGTATGGGGCTGCCTGTGCTGAC 
TGCTGCCTTGCCCGGGACCCTTACTGTGCCTGGGATGGCCAGGCCTGCTCCCGCTATACAGCATCCTCCA 

35 AGAGGCGGAGCCGCCGGCAGGACGTCCGGCACGGAAACCCCATCAGGCAGTGCCGTGGGTTCAACTCCAA 
TGCCAACAAGAATGCCGTGGAGTCTGTGCAGTATGGCGTGGCCGGCAGCGCAGCCTTCCTTGAGTGCCAG 
CCCCGCTCGCCCCAAGCCACTGTTAAGTGGCTGTTCCAGCGAGATCCTGGTGACCGGCGCCGAGAGATTC 
GTGCAGAGGACCGCTTCCTGCGCACAGAGCAGGGCTTGTTGCTCCGTGCACTGCAGCTCAGCGATCGTGG 
CCTCTACTCCTGCACAGCCACTGAGAACAACTTTAAGCACGTCGTCACACGAGTGCAGCTGCATGTACTG 

40 GGCCGGGACGCCGTCCATGCTGCCCTCTTCCCACCACTGTCCATGAGCGCCCCGCCACCCCCAGGCGCAG 
GCCCCCCAACGCCTCCTTACCAGGAGTTAGCCCAGCTGCTGGCCCAGCCAGAAGTGGGCCTCATCCACCA 
GTACTGCCAGGGTTACTGGCGCCATGTGCCCCCCAGCCCCAGGGAGGCTCCAGGGGCACCCCGGTCTCCT 
GAGCCCCAGGACCAGAAAAAGCCCCGGAACCGCCGGCACCACCCTCCGGACACATGAGGCCAGCTGCCTG 
TTCTGCCATGGGCAGCCTAGCCCTTGTCCCTTTTAATATAAAAGACATATATATATATATATATATATAT 

45 ATATAAAAATATCTATATTCTATACACACCCTGCCCCTGCAAAGACAGTATTTATTGGTGGGTTGAATAT 
AGCCTGCCTCAGTGGCAGCATCCTCCAAAACTTAGACCCATGCTGGTCAGAGACGGCAGAAAACAGAGCC 
TGCCTAACCAGGCCAGCCAGTTAGTGGGGCCAGGCCAGGACCACACAGTCCCCAGACTCAGCTGGAGTCT 
ACCTGCTGGACAGGCCTTGCGCCAAGATCTACAGGACAAAGGGAGGGAGCAAGCCCTACTCGGATGGGGC 
ACGGACCGTCCACCTTTCTGATGTGTGTTGTCAGCCTGTGCTGTGGCATAGACATGGATGCGAGGACCAC 

50 TTTGGAGACTGGGGTGGCCTCAAGAGCACACAGAGAAGGGAAGAAGGGGCCATCACAGGATGCCACCCCT 
GCCTTGGGTTGGGGGCACTCTGCCACGACCAGCCCCTTCCTGGGTATTTATTCTCTATTTATTGGGGATA 
GGAGAAGAGGCATCCTGCCTGGGTGGGACAGCCCCTTGGGCCCCTTCTCCCCTCCCCGCCTGGCCAGGGA 
° * GGGCCACCCCACTCTACCTCCTTAGCTTTCCCTGTGCCACTTTGACTCAGAGGCTGGGAGCATAGCAGAG . 

GCCAGGCCCAGGCAGAGCTGACGGGAGGCCCCAGCTCTGAGGGGAGGGGGTCCGTGGTAGAGCCTGGGGC 
55 AGGTAGAGGCTCCCCAGGGCTCCCTTATGTCCACCACTTCAGGGGATGGGTGTGGATGTAATTAGCTCTG 
GGGGGCAGTTGGGTAGATGGGTGGGGGCTCCTGGTGGCCTTCTGCTGCCCAGGCCACAGCCGCCTTTGGG 
TTCCATCTTGCTAATAAACACTGGCTTTGGAACTAAAAAAAAAAAAAAAAAA 

SEQIDNO:438 

60 >gi 1 1061351 |gb (AAB18276 . 1 1 semaphorin III family homolog 

MLVAGLLLWAS LLTGAWPSFPTQDHLPATPRVRLS FKELKATGTAHFFNFIiliNTTDYR ILLKDEDHDRMY 
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VG3KDYVLSLDLHDINREPLIIHWAASPQRIEECVLSGKDVNGECGNFVRLIQPWNRTHLYVCGTGAYNP 
MCTOVNRGRRAQDYIFYLEPERLESGKGKCPYDPKLDTASALINEELYAGVYIDFMGTDAAIFRTLGKQT 
AMRTDQYNSRWLNDPSFIHAELIPDSAENDDKLYFFFRERSAEAPQSPAVYARIGRICLNDDGGHCCLVN 
KWSTFLKARLVCSVPGEDGIETHFDELQDVFVQQTQDVRNPVIYAVFTSSGSVFRGSAVCVYSMADIRMV 
FNGPFAHKEGPNYQWMPFSGKMPYPRPGTCPGGTFTPSMKSTKDYPDEVINFMRSHPLMYQAVYPLQRRP 
L\^TGAPYRLTTIAVDQVDSADGRYEVL.FLGTDRGTVQKVIVLPKDDQEMEELMLEEVEVFKDPAPVKT 
MTISSKRQQLYVASAVGVTHLSLHRCQAYGAACADCCLARDPYCAWDGQACSRYTASSKRRSRRQDVRHG 
NPIRQCRGFNSMANKNAVESVQYGVAGSAAFLECQPRSPQATVKWLFQRDPGDRRREIRAEDRFLRTEQG 
LLLRALQLSDRGLYSCTATEmJFKHVVTRVQLHVLGRDAVHAAIiFPPLSMSAPPPPGAGPPTPPYOELAO 
LLAQPEVGLIHQYCQGYWRHVPPSPREAPGAPRSPEPQDQKKPRNRRHHPPDT 

SEQ ID NO:439 

SPARC 

>gi | 45 07170 |ref|NM_003 118.1 | Homo sapiens secreted protein, acidic 
cysteine-rich (osteonectin) (SPARC) , mRNA 

CGGGAGAGCGCGCTCTGCCTGCCGCCTGCCTGCCTGCCACTGAGGGTTCCCAGCACCATGAGGGCCTGGA 

TCTTCTTTCTCCTTTGCCTGGCCGGGAGGGCCTTGGCAGCCCCTCAGCAAGAAGCCCTGCCTGATGAGAC 

AGAGGTGGTGGAAGAAACTGTGGCAGAGGTGACTGAGGTATCTGTGGGAGCTAATCCTGTCCAGGTGGAA 

GTAGGAGAATTTGATGATGGTGCAGAGGAAACCGAAGAGGAGGTGGTGGCGGAAAATCCCTGCCAGAA.ee 

ACCACTGCAAACACGGCAAGGTGTGCGAGCTGGATGAGAACAACACCCCCATGTGCGTGTGCCAGGACCC 

CACCAGCTGCCCAGCCCCCATTGGCGAGTTTGAGAAGGTGTGCAGCAATGACAACAAGACCTTCGACTCT 

TCCTGCCACTTCTTTGCCACAAAGTGCACCCTGGAGGGCACCAAGAAGGGCCACAAGCTCCACCTGGACT 

ACATCGGGCCTTGCAAATACATCCCCCCTTGCCTGGACTCTGAGCTGACCGAATTCCCCCTGCGCATGCG 

GGACTGGCTCAAGAACGTCCTGGTCACCCTGTATGAGAGGGATGAGGACAACAACCTTCTGACTGAGAAG 

CAGAAGCTGCGGGTGAAGAAGATCCATGAGAATGAGAAGCGCCTGGAGGCAGGAGACCACCCCGTGGAGC 

TGCTGGCCCGGGACTTCGAGAAGAACTATAACATGTACATCTTCCCTGTACACTGGCAGTTCGGCCAGCT 

GGACCAGCACCCCATTGACGGGTACCTCTCCCACACCGAGCTGGCTCCACTGCGTGCTCCCCTCATCCCC 

ATGGAGCATTGCACCACCCGCTTTTTCGAGACCTGTGACCTGGACAATGACAAGTACATCGCCCTGGATG 

AGTGGGCCGGCTGCTTCGGCATCAAGCAGAAGGATATCGACAAGGATCTTGTGATCTAAATCCACTCCTT 

CCACAGTACCGGATTCTCTCTTTAACCCTCCCCTTCGTGTTTCCCCCAATGTTTAAAATGTTTGGATGGT 

TTGTTGTTCTGCCTGGAGACAAGGTGCTAACATAGATTTAAGTGAATACATTAACGGTGCTAAAAATGAA 

AATTCTAACCCAAGACATGACATTCTTAGCTGTAACTTAACTATTAAGGCCTTTTCCACACGCATTAATA 

GTCCCATTTTTCTCTTGCCATTTGTAGCTTTGCCCATTGTCTTATTGGCACATGGGTGGACACGGATCTG 

CTGGGCTCTGCCTTAAACACACATTGCAGCTTCAACTTTTCTCTTTAGTGTTCTGTTTGAAACTAATACT 

TACCGAGTCAGACTTTGTGTTCATTTCATTTCAGGGTCTTGGCTGCCTGTGGGCTTCCCCAGGTGGCCTG 

GAGGTGGGCAAAGGGAAGTAACAGACACACGATGTTGTCAAGGATGGTTTTGGGACTAGAGGCTCAGTGG 

TGGGAGAGATCCCTGCAGAATCCACCAACCAGAACGTGGTTTGCCTGAGGCTGTAACTGAGAGAAAGATT 

CTGGGGCTGTCTTATGAAAATATAGACATTCTCACATAAGCCCAGTTCATCACCATTTCCTCCTTTACCT 

TTCAGTGCAGTTTCTTTTCACATTAGGCTGTTGGTTCAAACTTTTGGGAGCACGGACTGTCAGTTCTCTG 

GGAAGTGGTCAGCGCATCCTGCAGGGCTTCTCCTCCTCTGTCTTTTGGAGAACCAGGGCTCTTCTCAGGG 

GCTCTAGGGACTGCCAGGCTGTTTCAGCCAGGAAGGCCAAAATCAAGAGTGAGATGTAGAAAGTTGTAAA 

ATAGAAAAAGTGGAGTTGGTGAATCGGTTGTTCTTTCCTCACATTTGGATGATTGTCATAAGGTTTTTAG 

CATGTTCCTCCTTTTCTTCACCCTCCCCTTTGTTCTTCTATTAATCAAGAGAAACTTCAAAGTTAATGGG 

ATGGTCGGATCTCACAGGCTGAGAACTCGTTCACCTCCAAGCATTTCATGAAAAAGCTGCTTCTTATTAA 

TCATACAAACTCTCACCATGATGTGAAGAGTTTCACAAATCTTTCAAAATAAAAAGTAATGACTTAGAAA 
CTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO:440 

>gi|450717l|ref |NP_003109.1| secreted protein, acidic, cysteine-rich 
(osteonectin) ; Osteonectin (secreted protein, acidic, cysteine-rich) iHomo 

MRAWIFFLLCLAGRALAAPQQEALPDETEWEETVAEVTEVSVGANPVQVEVGEFDDGAEETEEEWAEN 
f^™ raG ^ CELDE ^ TPMCVC Q DPT SCPAPIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHK 
LmDYIGPCKYIPPCLDSELTEFPLRMRDWLKNVLVTLYERDEDNNLLTEKQKLRVKKIHENEKRLEAGD 
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SEQ ID NO:441 

ssDNA binding protein- 1 

>GL1_4__1_G3F1 Direction: sense 

CTGGCAGACCCTGGCCCAATCCTAACAGTGCTAACTCAATTCCATACTCCTCCTCATCACCTGGCTACCTATGTG 
5 GGACCCCCTGGNTGGTGGCGGTCCTCCAGGAAGACCCATTATGCCCAGTCCCGCAGATTCAACAAATTCCAGTGA 
CAACATCTACACAATGATTAATCCAGTGCCGCCTGGAGGCAGCCGGTCCAACTTCCCGATGGGTCCCGGCTCGAA 
CGGTCCGATGGGCGGCATGGGTGGCATGGAGCCACACCACATGAATGGATCATTAGGGTCAGGCGACATAGACGG 
NCTTCCAAAAAATTCTCCTAACAACATAAGTGGCATTAGCAATCCTCCAGGCACCCCTCGAGATGACGGCGAG 

•* 10 SEQIDNO:442 

>GL1_4_1_G3R1 Direction: anti- sense 

CTCGCCGTCATCTCGAGGGGTGCCTGGAGGATTGCTAATGCCACTTATGTTGTTAGGAGAATTTTTTGGAAGTCC 
GTCTATGTCGCCTGACCCTAATGATCCATTCATGTGGTGTGGCTCCATGCCACCCATGCCGCCCATCGGACCGTT 
CGAGCCGGGACCCATCGGGAAGTTGGACCGGCTGCCTCCAGGCGGCACTGGATTAATCATTGTGTAGATGTTGTC 
15 ACTGGAATTTGTTGAATCTGCGGGACTGGGCATAATGGGTGTTCCTGGAGGACCGCCACCACCAGGGGGTCCCAC 
ATAGGTACCAGGTGATGAGGAGGAGTATGGAATTGAGTTAGCACTGTTAGGATTGGGCCAGGGTCTGCCAG 

SEQ ID NO:443 

>GL1_4_2_G3F1 Direction: sense 
20 CTGGCAGACCCTGGCCCAATCCTAACAGTGCTAACTCAATTCCATACTCCTCCTCATCACCTGGTACCTATGTGG 
GACCCCCTGGCTGGTGGCGGTCCTC CAGGAACAC CCATTATGC CCAGTC C CGCAGATTCAACAAATTCCAGTGAC 
AACATCTACACAATGATTAATCCAGTGCCGCCTGGAGGCAGCCGGTCCAACTTCCCGATGGGTCCCGGCTCGAAC 
GGTCCGATGGGCGGCATGGGTGGCATGGAGCCACACCACATGAATGGATCATTAGGGTCAGGCGACATAGACGGA 
CTTCCAAAT^AATTCTCCTAACAACATAAGTGGCATTAGCAATCCTCCAGGCACCCCTCGAGATGACGGCGAG 

25 

SEQ ID NO:444 

>GL1_4_2_G3R1 Direction: anti -sense 

CTCGNCGATCATCTCNAGGGGTGCCTGTAAGGATNGCATAAGTGCCACTTATGTTGTNAGGAGAATTTTTTGGAA 
GTCCGTCTATGTCGCCTGACCCTAATGATCCATTCATGTGGTGTGGCTCCATGCCACCCATGCCGCCCATCGGAC 
30 CGTTCGAGCCGGGACCCATCGGGAAGTTGGACCGGCTGCCTCCAGGCGGCACTGGATTAATCATTGTGTAGATGT 
TGTCACTGGAATTTGTTGAATCTGCGGGACTGGGCATAATGGGTGTTCCTGGAGGACCGCCACCACCAGGGGGTC 
CCACATAGGTACCAGGTGATGAGGAGGAGTATGGAATTGAGTTAGCACTGTTAGGATTGGGCCAGGGTCTGCCAG 

SEQIDNO:445 

35 >gi [21314683 |ref|NM_018070. 2 | Homo sapiens single stranded DNA binding 
protein 3 (SSBP3) , mRNA 

CCGCGGCCGGGGCTGACGCTTTGACAGCTGGAAAGAGCGCGGAGCCAGCGCCTGGGGGGGAGGGAGGGGA 
GCGCGGCGAGGAGAGCGCCAGCGAGCGAGAGAGCGAGCGAGCGCCGGGGAGGGGGCCGGGAGCGAGGGGC 
AGCTCGGGAGAGCCGGAGCGGTAGCGGCGGCGGCGGCGGCGGCGGCGAGGCTCGGCGCCCTCTTCCCTGC 

40 AAACCATGTTTGCCAAAGGCAAAGGCTCGGCGGTGCCCTCGGATGGGCAGGCTCGGGAAAAGTTAGCTTT 
ATACGTCTACGAATATTTACTGCACGTAGGAGCACAGAAATCTGCACAGACCTTCTTATCGGAGATTCGA 
TGGGAAAAAAACATCACGTTGGGAGAACCGCCTGGGTTTTTGCACTCGTGGTGGTGTGTATTTTGGGACC 
TTTACTGTGC^GCTCCTGAAAGGAGAGACACTTGTGAACATTCAAGTGAAGCAAAAGCCTTTCATGATTA 
TAGTGCAGCAGCTGCCCCGAGCCCCGTGCTTGGCAACATTCCCCCCAACGATGGGATGCCGGGAGGCCCC 

45 ATCCCGCCAGGTTTCTTTCAGGGTCCTCCGGGGTCACAGCCCTCGCCGCACGCACAGCCTCCACCTCACA 
ATCCTAGCAGCATGATGGGACCCCACAGTCAGCCTCCGGGAGGAGTTCCTGGGACACAGCCATTGCTGCC 
CAATTCTATGGATCCCACACGACAACAAGGCCACCCCAACATGGGAGGATCAATGCAGAGAATGAACCCT 
CCCCGAGGCATGGGGCCCATGGGTCCCGGCCCACAGAATTACGGCAGCGGCATGAGACCACCACCCAACT 
CCCTCGGCCCCGCCATGCCCGGGATTAACATGGGCCCGGGAGCTGGCAGACCCTGGCCCAATCCTAACAG 

50 TGCTAACTCAATTCCATACTCCTCCTCATCACCTGGTACCTATGTGGGACCCCCTGGTGGTGGCGGTCCT 
CCAGGAACACCCATTATGCCCAGTCCCGCAGATTCAACAAATTCCAGTGACAACATCTACACAATGATTA 
* * ATCCAGTGCCGCCTGGAGGCAGCCGGTCCAACTTCCCGATGGGTCCCGGCTCGGACGGTCCGATGGGCGG 
CATGGGTGGC^TGGAGCCACACCACATGAATGGATGA^ 

AATTCTCCTAACAACATAAGTGGCATTAGCAATCCTCCAGGCACCCCTCGAGATGACGGCGAGCTAGGAG 
55 GGAACTTCCTCCACTCCTTTCAGAACGACAATTATTCTCCAAGCATGACGATGAGTGTGTGATCCCCCCT 
TCTCCGAGACGCTGAGAGAGCAGGCATTGCAGGCGGGAAGATGCCAGAAATTATGCAAGAAGTGAGGTGT 
CATTATCCAGGAGCTGGTGGGGAGGGCATCTCCCTGCTCCCCTCAACCCCCTCCCACCCCATCCACGCCC 
CCTACCTTTCCCAATTTTAGTTTCATGCAATAAAAAGGCCAAACTTTTTATTCCA 
AAAAAAAAAAAAAAAAAAAAA 

60 
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SEQ ID NO:446 

>gi | 21314684 |ref |NP_060540.2 | single stranded DNA binding protein 3; 
sequence-specific single- stranded-DNA-binding protein [Homo sapiens] 
MFAKGKGSAVPSDGQAREKLALYVYEYLLHVGAQKSA 

CAAPERRDTCEHSSElAiCAFHDySAAAAPSPVLGNIPPNDGMPGGPIPPGFFQGPPGSQPSPHAQPPPHNP 

SSMMGPHSQPPGGVPGTQPLLPNSMDPTRQQGHPNMGGSMQRI^PPRGMGPMGPGPQNyGSGMRPPPNSL 

GPAMPGINMGPGAGRPWPNPNSANSIPYSSSSPGTYVGPPGGGGPPGTPIMPSPADSTNSSDMIYTMINP 

VPPGGSRSNFPMGPGSDGPMGGMGGMEPHHMNGSLGSGDIDGLPKNSPNNISGISNPPGTPRDDGEI/GGN 
FLHSFQNDNYSPSMTMSV 

SEQ ID NO:447 

SUMO-specif ic protease-1 (SUSP-1) 
><33_1_41_PCR_G3F1 Direction: sense 

GGAGTCGTGGGCCGAGAGGAACCGGGCCCGGGAAGCGCCGTCGTCGTCGTCGCCGGTCGCGTTCCCCCGGAGAGG 
CCTGAGAAGCTCCGGGCCGCGGGCCTCGCTGCCCGCCAGCCCGCGGACAGGCCCGGGCGCGCCTGGCCTGCCTTT 
GTATAGGCCCGTCTGAACGTGGGAGCGCAGCCCGCCTGACGGCTGAGCCCGAGGCCCGCAACCCTGCGGCGTCTA 
CCCTCCTCCGGCGCGGCCCCTCATCCCGGCGAGCACGGCGGCGGTGTGGGCCATGGATTAAGAAGGAGGCGGCGT 
GGGAGGAGGAAGATGGCGGCCGGCAAGAGCGGCGGTAGCGCAGGGGAGATTACTTTTCTGGAAGCTTTGGCTAGA 
TCAGAGTCTAAGAGAGATGGAGCCGCAGAACCAGCACAGTGGTTAGATGATAAGCGGCCGCTCGACTAGTTCT 

SEQ ID NO:448 

>GL3_1.41_AJM13F Direction: sense 

GGGCGGTGGATGGGGAGTCGTGGGCCGAGAGGAACCGGGCCCGGGAAGCGCCGTCGTCGTCGTCGCCGGTCGCGT 

TGCCCCGGAGAGGCCTGAGAAGCTCGGGCCGCGGGCCTCGCTGCCCGCCAGCeCGCGGACAGGCCCGGGCGCGCC 

TGGCCTGCCTTTGTATAGGCCCGTCTGAACGTGGGAGCGCAGCCCGCCTGACGGCTGAGCCCGAGGCCCGCAACC 

CTGCGGCGTCTACCCTCCTCCGGCGCGGCCCCTCATCCCGGCGAGCACGGCGGCGGTGTGGGCCATGGATTAAGA 

AGGAGGCGGCGTGGGAGGAGGAAGATGGCGGCCGGCAAGAGCGGCGGTAGCGCAGGGGAGATTACTTTTCTGGAA 

GCTTTGGCTAGATCAGAGTCTAAGAGAGATGGAGCCGCANAACCAGCACAGTGGTTAGATAGATAAAGCGGCCGC 

TCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTCCAGCACACTGGCGGCCGTTACTAGTG 

GATCCGAGCTCGGTACCAAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTTGTGAAATTGTTATCCGCTCAC 
AATTCCACACAACA 

SEQ ID NO:449 

>GL3_1.41_A_M13R Direction: anti-sense 

ATCACTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTACGACAGTCAGTGAGTATCAGACCTCAGACTAGTC 
GAGCGGCCGCTTTATCTATCTAACCACTGTGCTGGTTCTGCGGCTCCATCTCTCTTAGACTCTGATCTAGCCAAA 
GCTTCCAGAAAAGTAATCTCCCCTGCGCTACCGCCGCTCTTGCCGGCCGCCATCTTCCTCCTCCCACGCCGCCTC 
CTTCTTAATCCATGGCCCACACCGCCGCCGTGCTCGCCGGGATGAGGGGCCGCGCCGGAGGAGGGTAGACGCCGC 
AGGGTTGCGGGCCTCGGGCTCAGCCGTCAGGCGGGCTGCGCTCCCACGTTCAGACGGGCCTATACAAAGGCAGGC 
CAGGCGCGCCCGGGCCTGTCCGCGGGCTGGCGGGCAGCGAGGCCCGCGGCCCGAGCTTCTCAGGCCTCTCCGGGG 
GAACGCGACCGGCGACGACGACGACGGCGCTTCCCGGGCCCGGTTCCTCTCGGCCCACGACTCCCCATCCACCGC 



SEQ ID NO:450 

>GL3__1.41_B_M13F Direction: sense 

GGGCGGTGGATGGGGAGTCGTGGGCCGAGAGGAACCGGGCCCGGGAAGCGCCGTCGTCGTCGTCGCCGGTCGCGT 

TCCCCCGGAGAGGCCTGAGAAGCTCGGGCCGCGGGCCTCGCTGCCCGCCAGCCCGCGGACAGGCCCGGGCGCGCC 

TGGCCTGCCTTTGTATAGGCCCGTCTGAACGTGGGAGCGCAGCCCGCCTGACGGCTGAGCCCGAGGCCCGCAACC 

CTGCGGCGTCTACCCTCCTCCGGCGCGGCCCCTCATCCCGGCGAGCACGGCGGCGGTGTGGGCCATGGATTAAGA 

AGGAGGCGGCGTGGGAGGAGGAAGATGGCGGCCGGCAAGAGCGGCGGTAGCGCAGGGGAGATTACTTTTCTGGAA 

GCTTTGGCTAGATCAGAGTCTAAGAGAGATGGAGCCGCAGAACCAGCACAGTGGTTAGATAGATAAAGCGGCCGC 

TCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGAAGGGCGAATTCCAGCACACTGGCGGCCGTTACTAGTGG 

ATCCGAGCTCGGTACCAAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTNATCCGCTCACAA 
TTCCACACAACATACGAGCCGGAAGCAT 

SEQIDNO:451 

>GL3_JL.41_Bjyil3R Direction: anti-sense 

CGGATCACTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTCGACAGTCAGTGAGTATCAGACCTCAGACTAG 
TCGAGCGGCCGCTTTATCTATCTAACCACTGTGCTGGTTCTGCGGCTCCATCTCTCTTAGACTCTGATCTAGCCA 
AAGCTTCCAGAAAAGTAATCTCCCCTGCGCTACCGCCGCTCTTGCCGGCCGCCATCTTCCTCCTCCCACGCCGCC 
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TCCTTCTTAATCCATGGCCCACACCGCCGCCGTGCTCGCCGGGATGAGGGGCCGCGCCGGAGGAGGGTAGACGCC 
GCAGGGTTGCGGGCCTCGGGCTCAGCCGTCAGGCGGGCTGCGCTCCCACGTTCAGACGGGCCTATACAAAGGCAG 
GCCAGGCGCGCCCGGGCCTGTCCGCGGGCTGGCGGGCAGCGAGGCCCGCGGCCCGAGCTTCTCAGGCCTCTCCGG 
GGGAACGCGACCGGCGACGACGACGACGGCGCTTCCCGGGCCCGGTTCCTCTCGGCCCACGACTCCCCATCCACC 
5 GCCC 

SEQ ID NO:452 

>GL3_1.41_C_M13F Direction: sense 

GGGCGGTGGATGGGGAGTCGTGGGCCGAGAGGAACCGGGCCCGGGAAGCGCCGTCGTCGTCGTCGCCGGTCGCGT 
10 TCCCCCGGAGAGGCCTGAGAAGCTCGGGCCGCGGGCCTCGCTGCCCGCCAGCCCGCGGACAGGCCCGGGCGCGCC 
TGGCCTGCCTTTGTATAGGCCCGTCTGAACGTGGGAGCGCAGCCCGCCTGACGGCTGAGCCCGAGGCCCGCAACC 
CTGCGGCGTCTACCCTCCTCCGGCGCGGCCCCTCATCCCGGCGAGCACGGCGGCGGTGTGGGCCATGGATTAAGA 
AGGAGGCGGCGTGGGAGGAGGAAGATGGCGGCCGGCAAGAGCGGCGGTAGCGCAGGGGAGATTACTTTTCTGGAA 
GCTTTGGCTAGATCAGAGTCTAAGAGAGATGGAGCCGCAGAACCAGCACAGTGGTTAGATAGATAAAGCGGCCGC 
15 TCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTCCAGCACACTGGCGGCCGTTACTAGTG 
GATCCGAGCTCGGTACCAAGCTTGGCGTAATCATGGTCATAGCTGTTTCC 

SEQ ID NO:453 

>GL3_1 . 41_C_M13R Direction: anti-sense 

20 GCTCGGATCACTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTACGACAGTCAGTGAGTATCAGACCTCAGA 
CTAGTCGAGCGGCCGCTTTATCTATCTAACCACTGTGCTGGTTCTGCGGCTCCATCTCTCTTAGACTCTGATCTA 
GCCAAAGCTTCCAGAAAAGTAATCTCCCCTGCGCTACCGCCGCTCTTGCCGGCCGCCATCTTCCTCCTCCCACGC 
CGCCTCCTTCTTAATCCATGGCCCACACCGCCGCCGTGCTCGCCGGGATGAGGGGCCGCGCCGGAGGAGGGTAGA 
CGCCGCAGGGTTGCGGGCCTCGGGCTCAGCCGTCAGGCGGGCTGCGCTCCCACGTTCAGACGGGCCTATACAAAG 

25 GCAGGCCAGGCGCGCCCGGGCCTGTCCGCGGGCTGGCGGGCAGCGAGGCCCGCGGCCCGAGCTTCTCAGGCCTCT 
CCGGGGGAACGCGACCGGCGACGACGACGACGGCGCTTCCCGGGCCCGGTTCCTCTCGGCCCACGACTCCCCATC 
CACCGCCC 

SEQ ID NO:454 

30 >GIi3_1.41_D_M13F Direction: sense 

GGGCGGTGGATGGGGAGTCGTGGGCCGAGAGGAACCGGGCCCGGGAAGCGCCGTCGTCGTCGTCGCCGGTCGCGT 
TCCCCCGGAGAGGCCTGAGAAGCTCGGGCCGCGGGCCTCGCTGCCCGCCAGCCCGCGGACAGGCCCGGGCGCGCC 
TGGCCTGCCTTTGTATAGGCCCGTCTGAACGTGGGAGCGCAGCCCGCCTGACGGCTGAGCCCGAGGCCCGCAACC 
CTGCGGCGTCTACCCTCCTCCGGCGCGGCCCCTCATCCCGGCGAGCACGGCGGCGGTGTGGGCCATGGATTAAGA 

35 AGGAGGCGGCGTGGGAGGAGGAAGATGGCGGCCGGCAAGAGCGGCGGTAGCGCAGGGGAGATTACTTTTCTGGAA 
GCTTTGGCTAGATCAGAGTCTAAGAGAGATGGAGCCGCAGAACCAGCACAGTGGTTAGATAGATAAAGCGGCCGC 
TCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTCCAGCACACTGGCGGCCGTTACTAGTG 
GATCCGAGCTCGGTACCAAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAT^TTGTTATCCG 

40 SEQ ID NO:455 

>GL3_1 ,41__D_M13R Direction: anti-sense 

GATCACTAGTAACGGCCGCCAGTGTGCTGGAATTCGCCCTTACGACAGTCAGTGAGTATCAGACCTCAGACTAGT 
CGAGCGGCCGCTTTATCTATCTAACCACTGTGCTGGTTCTGCGGCTCCATCTCTCTTAGACTCTGATCTAGCCAA 
AGCTTCCAGAAAAGTAATCTCCCCTGCGCTACCGCCGCTCTTGCCGGCCGCCATCTTCCTCCTCCCACGCCGCCT 
45 CCTTCTTAATCCATGGCCCACACCGCCGCCGTGCTCGCCGGGATGAGGGGCCGCGCCGGAGGAGGGTAGACGCCG 
CAGGGTTGCGGGCCTCGGGCTCAGCCGTCAGGCGGGCTGCGCTCCCACGTTCAGACGGGCCTATACAAAGGCAGG 
CCAGGCGCGCCCGGGCCTGTCCGCGGGCTGGCGGGCAGCGAGGCCCGCGGCCCGAGCTTCTCAGGCCTCTCCGGG 
GGAACGCGACCGGCGACGACGACGACGGCGCTTCCCGGGCCCGGTTCCTCTCGGCCCACGACTCCCCATCCACCG 
CCC 

50 

SEQ ID NO:456 

>gi |766231l|ref | NM_0155 71 . 1 1 Homo sapiens SUMO- 1- specific protease (SUSP1) , 
mRNA 

ATGGCGGCCGGCAAGAGCGGCGGTAGCGCAGGGGAGATTACTTTTCTGGAAGCTTTGGCTAGATCAGAGT 
55 CTAAGAGAGATGGAGGTTTTAAAAATAATTGGAGCTTTGATCATGAAGAAGAAAGTGAAGGAGATACAGA 

TAAAGATGGGACAAATCTGCTCAGTGTGGATGAAGATGAGGATTCTGAAACCTCAAAAGGAAAAAAGTTA 

AATCGTCGATCTGAAATTGTTGCTAATAGCTCTGGTGAATTCATCTTGAAGACATATGTAAGACGAAACA 

AGTCTGAAAGTTTTAAAAOTTTGAAAGGCAACCCAATTGGACTTAACATG 

GAGTGAAAATATGCAAAATACGTCATTATGTTCTGGAACTGT^ 
60 CATGGAC^GATACGAGTAGTAAAAACAGCAGCCCAAAGCAGTCTGGACCGAAAAGAAAGGAAAGAATACC 
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CACCTCATGTCCAAAAAGTTGAAATTAATCCTGTAAGGTTAAGTCGGCTCCAAGGTGTTGAACGTATAAT 

GAAGAAAACAGAAGAGTCCGAATCACAAGTGGAGCCTGAAATTAAGAGGAAAGTACAACAGAAACGGCAC 

TGTAGTACCTATCAGCCTACTCCTCCTCTATCTCCTGCTTCAAAAAAATGTTTAACCCATTTAGAGGATT 

TGCAAAGAAATTG(^GAC^GCTATTACTTTGAATGAGTCTACTGGACCATTATTAAGAACGTCAATTCA 

TCAGAATTCTGGAGGACAGAAGTCACAAAACACAGGATTAACAACCAAGAAGTTTTATGGCAACAATGTG 

GAAAAGGTTCCAATTGATATTATTGTGAATTGTGATGACAGTAAACACACTTATTTACAGACTAATGGAA 

AAGTCATTTTACCTGGGGCAAAAATACCCAAAATCACAAACTTGAAAGAAAGGAAAACAAGTTTGTCAGA 

CCTAAATGATCCAATCATTTTGTCCAGTGATGATGATGATGACAACGACAGAACTAACAGAAGAGAAAGC 

ATATCTCCTCAGCCTGCTGATTCAGCATGTTCTTCCCCTGCACCATCCACTGGAAAAGTAGAAGCAGCAC 

TAAATGAAAATACTTGCAGAGCAGAGCGTGAACTACGAAGCATTCCAGAAGACTCAGAGTTAAATACAGT 

TACATTGCCAAGAAAAGCAAGAATGAAAGACCAGTTTGGCAATTCTATTATCAACACACCTCTGAAACGT 

CGTAAAGTGTTTTCTCAAGAACCTCCAGATGCTTTAGCTTTAAGCTGCCAAAGTTCCTTTGACAGTGTCA 

TTTTAAACTGTCGAAGTATACGAGTAGGAACACTCTTCCGGCTGTTAATAGAGCCTGTAATTTTTTGTTT 

AGATTTTATCAAGATACAGCTAGACGAACCAGACCATGATCCTGTAGAGATTATATTAAATACCTCTGAT 

CTAACTAAATGTGAATGGTGTAATGTCCGAAAATTACCTGTAGTGTTTCTTCAAGCAATTCCAGCAGTTT 

ATCAAAAGCTGAGCATCCAACTGCAAATGAATAAGGAGGATAAAGTTTGGAATGATTGTAAAGGAGTAAA 

TAAATTAACAAATTTAGAAGAACAATATATAATTTTAATTTTTCAAAATGGCCTTGATCCTCCGGCAAAT 

ATGGTATTTGAAAGTATCATTAATGAAATTGGTATAAAGAATAACATCTCCAATTTTTTTGCGAAAATTC 

CCTTTGAAGAAGCTAATGGCAGACTTGTTGCCTGTACAAGAACCTATGAAGAGAGCATCAAAGGAAGTTG 

TGGGCAAAA.GGAAAACAAAATTAAAACTGTATCATTTGAATCTAAAATACAACTTAGAAGCAAACAAGAA 

TTTCAGTTTTTTGATGAAGAAGAAGAAACTGGAGAAAACCACACCATCTTCATTGGCCCAGTAGAAAAGT 

TGATAGTATATCCACCACCTCCAGCTAAGGGAGGCATCTCTGTTACCAATGAGGACCTGCACTGTCTAAA 

TGAAGGAGAATTTTTAAATGATGTTATTATAGACTTTTATTTGAAATACTTGGTGCTTGAAAAACTGAAG 

AAGGAAGACGCTGACCGAATTCATATATTCAGTTCTTTTTTCTATAAACGCCTTAATCAGAGAGAGAGGA 

GAAATCATGAAACAACTAATCTGTCAATACAGCAAAAACGGCATGGGAGAGTAAAAACATGGACCCGGCA 

CGTAGATATTTTTGAGAAGGATTTTATTTTTGTACCCCTTAATGAAGCTGCACACTGGTTTTTGGCTGTT 

GTTTGTTTCCCCGGTTTGGAAAAACCAAAGTATGAACCTAATCCTCATTACCATGAAAATGCTGTCATAC 

AGAAATGTTCAACTGTAGAGGACAGTTGTATTTCTTCTTCAGCCAGTGAAATGGAGAGTTGTTCACAAAA 

CTCTTCTGCCAAGCCTGTAATTAAGAAGATGCTAAACAAAAAACATTGCATAGCTGTAATTGATTCCAAT 

CCTGGGCAGGAAGAAAGTGACCCTCGTTATAAGAGAAACATATGCAGTGTAAAATACAGTGTGAAAAAAA 

TAAATCATACTGCGAGTGAAAATGAAGAATTCAATAAAGGAGAATCTACATCCCAGAAAGTTGCTGATAG 

GACTAAAAGTGAGAATGGCCTACAGAATGAAAGTTTAAGTTCCACACATCATACAGATGGCTTAAGCAAA 

ATCAGACTAAACTATAGCGATGAATCACCTGAAGCTGGTAAAATGCTTGAAGATGAACTCGTCGACTTCT 

CAGAAGATCAGGATAACCAGGATGATAGCAGTGACGATGGATTCCTCGCTGATGACAACTGCAGTTCAGA 

AATAGGACAGTGGCATTTAAAGCCTACTATCTGTAAACAACCTTGTATCCTACTTATGGACTCACTCCGA 

GGCCCTTCTCGGTCAAATGTTGTCAAAATTTTAAGAGAGTATTTAGAAGTGGAATGGGAAGTTAAAAAAG 

GAAGCAAAAGAAGTTTTTCCAAAGATGTTATGAAGGGCTCTAATCCAAAAGTACCACAGCAAAACAACTT 

CAGTGACTGTGGTGTATATGTATTGCAGTATGTAGAGAGCTTTTTTGAGAATCCAATTCTCAGTTTTGAA 

CTACCTATGAATTTGGCAAACTGGTTTCCTCCACCAAGAATGAGAACAAAAAGAGAAGAAATCCGAAACA 

TAATTCTGAAGCTACAGGAAGATCAGAGCAAAGAGAAAAGAAAGCATAAGGACACTTACTCAACAGAAGC 

ACCTTTAGGCGAAGGAACAGAACAATGTGTCAATAGTATCTCAGATTGACCATTTCTGTTACTTGTCATT 

TCTACTTTCAGAAACTAAATGACTTTCAAATTTGGGTATAGACAATAAAGAACTGAAGTGCTCACTAC'TC 

AGTGATTTGGAAATTTTGATGCTTGTATAAATGTCAGATAATTAATTTCCAAAGGCGTATGTATTAAGTA 

AAAGTCTGTAAATATGTTAATGAGGCCAATTTTTCCAGCATTTATAATTATTTTTTTCACTTGTTAGGAA 

GCTTTTGTTATGTATTTTCTGTTAATAGTACCTAAAATTGCAACTTCTAAACCCAAATAAAAAGAAAATA 

TTTATAGGAGGAAATGATTAATTTGATATTCTTTAGTGAA.CTTGTTTAATTCCTCAGTGGGTGTGACATA 

TTTCATGGGAATATTCAAATATCTATGGTAATATTTTGACCCTTTATATTTGTTCTAAAATAAGTCAAAA 

TGTGAAAATAATATTAAATCTAAGATATTTTGAACTAAGCATCTTTATATGCTTGTGTAACAGGAACAAA 

GTAACAGCCTTTC^TTC^TATACTGCCTTGTGTTCAGTGAACCCAAGAAATGTAATAAATATTTGTAAT 

TTTACACAAATATTTAAGAGGAAAGAGTATTAAGAGCAATTCAAAAAAAGTAACCTTATACTACTAAAAA 

AAAAATTCTTGCATATATTATCATCAAATGCATTTTTGAAGACATCAAAGACTCAGGTTAAAR.CTATTTT 

GGTAAGTGCAGCTTGAATTTCAAATATCCCGTGTTACCTTTCTCTATTACAGCTTAAAGTATGCTACAAT 

CTGTGTCATATAGTTAATTGATAAGCATTTTTAATCTGTGTAAACACAGGAATTTAAATAGGAATTTACT 
ATTTTTTTAT 

SEQ ID NO:457 




NRRSEIVANSSGEFILKTYVRRNKSESFKTLKGNPIG^ 
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EKVPIDIIVWCDDSKHTyiiQTNGKVIIiPGAKIPKITNLKERKTSLSDIiNDPIILSSDDDDDNDRTNRRES 

ISPQPADSACS S PAPSTGKVEAALNENTCRAEREIiRS I PEDSELNTVTLPRKARMKDQFGNS I INTPLKR 
RKVFSQEPPDALALSCQSSFDSVILNCRSIRVGTLFRLLIEPVIFCLDFIKIQLDEPDHDPVEIILNTSD 

LTKCEWCNVRKLPWFLQAI PAVYQKIiS IQLQMNKEDKVWNDCKGVNKLTNLEEQYI IL I FQNGLDPP AN 
MVFESIINEIGIKNNISNFFAKIPFEEANGRLVACTRTYEESIKGSCGQKENKIKTVSFESKIQLRSKQE 
FQFFDEEEETGENHTIFIGPVEKLIVYPPPPAKGGISVTNEDLHCLNEGEFLNDVIIDFYLKyLVliEKLK 
KEDADRIHIFSSFFYKRLNQRERRMJETTNLSIQQKRHGRVKTWTRHVDIFEKDFI 

VCFPGLEKPKYEPNPHYHENAVIQKCSTVEDSCISSSASEMESCSQNSSAKPVIKKMLNKKHCIAVIDSN 
PGQEESDPRYKRNICSVKYSVKKINHTASENEEFNKGESTSQKVADRTKSENGLQNESLSSTHHTDGLSK 
IRLNYSDESPEAGKMLEDELVDFSEDQDNQDDSSDDGFLADDNCSSEIGQWHLKPTICKQPCILLMDSLR 
GPSRSNVVKILREYLEVEWEVKKGSKRSFSKDVMKGSNPKVPQQNNFSDCGVYVLQYVESFFENPIIiSFE 
LPMNLAWWFPPPRMRTKREEIRNIILKLQEDQSKEKRKHKDTYSTEAPLGEGTEQCVNSISD 

SEQ ID NO:458 

Synaptophysin homologue (pantophysin) 

>GL2 71 G03 G3F1 Direction: sense 

ACAGGAACACAAAGACTGCAAAGGTAACATAGAATTGTGCAGAAGAAGAGTAATCGCCTATGAGGACGTAATCTT 

TCCAATTTACATCACATATGTTTACACCTGGAGGTGGCTGAAATGATGCCTCATTCAACCTGAATGGATAACCAA 

AAGTAGCTGTAACAGTTTTATTCTCAGTAACTGCAGGAGGACAATTCACTTGAATTTCTGTTTGGCCCTTAAAAC 

CTCCACAGGTGGCAAAAGCAAAGATAGAAGCAATCCACTCGAGGACCTTGATGAAGCCGAGTGGCTCCTTGAGCG 

GGTTGAGGTTGATCTGGAAGCCGCCGCAGGACCCAGCGCAGTGGTTAGATAGATAAAGCGACCGCTCGACTAGTC 

TGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTCGTTTAAACCTGCAGGACTAGTCCCTTTAGTGAGGGT 

TAATTCTGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACA 

ACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTTGCGTTG 

CGCTCACTGGCCGCTTTCCAGTCGGGAAAACCTTGTCGTTGCCAGTGCATTAATGAATCGGCCAACGCCGCGGGG 

NAGAGGNCGGGGTAGCGTNAATAGGGGGCGGCCCCTTACCCGGTTTCCTCCCGCTCANTTGAATCCGCTTGGCCC 

CGGGCG 

SEQIDNO:459 

>GL2 71 G04 G3F1 Direction: sense m „ mm 
ACAGGAACACAAAGACTGCAAAGGTAACATAGAATTGTGCAGAAGAAGAGTAATCGCCTATGAGGACGTAATCTT 

TCCAATTTACATCACATATGTTTACACCTGGAGGTGGCTGAAATGATGCCTCATTCAACCTGAATGGATAACCAA 

AAGTAGCTGTAACAGTTTTATTCTCAGTAACTGCAGGAGGACAATTCACTTGAATTTCTG1TTGGCCCTTAAAAC 

CTCCACAGGTGGCAAAAGCAAAGATAGAAGCAATCCACTCGAGGACCTTGATGAAGCCGAGTGGCTCCTTGAGCG 

GGTTGAGGTTGATCTGGAAGCCGCCGCAGGACCCAGCGCAGTGGTTAGATAGATAAAGCGACCGCTCGACTAGTC 

TGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTCGTTTAAACCTGCAGGACTAGTCCCTTTANTGAGGGT 

TAATTCTGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACA 

ACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTTAATGAGTTGAGCTAACTCACATTTAATTTGCG 

TTTGCGCTCACTTGGGCCGCTTTCCAGGTTCGGGGAAAAACTGTCGTGCCCAGTTGCTTAATGAATCGGGCAACG 

CGCGGGGGAGAGGCGGTGGGTATNG 

SEQ ID NO:460 

>GL2 71 G05 G3F1 Direction: sense 

ACAGGAACACAAAGACTGCAAAGGTAACATAGAATTGTGCAGAAGAAGAGTAATCGCCTATGAGGACGTAATCTT 
TCCAATTTACATCACATATGTTTACACCTGGAGGTGGCTGAAATGATGCCTCATTCAACCTGAATGGATAACCAA 
AAGTAGCTGTAACAGTTTTATTCTCAGTAACTGCAGGAGGACAATTCACTTGAATTTCTGTTTGGCCCTTAAAAC 
CTCCACAGGTGGCAAAAGCAAAGATAGAAGCAATCCACTCGAGGACCTTGATGAAGCCGAGTGGCTCCTTGAGCG 
GGTTGAGGTTGATCTGGAAGCCGCCGCAGGACCCAGCACAGTGGTTAGATAGATAAAGCGGCCGCTCGACTAGTC 
TGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTCGCGGCCGCTAAATTCAATTCGCCCTATAGTG 

SEQ ID NO:461 

>GL2 71 G06_G3F1 Direction: sense 

ACAGGAACACAAAGACTGCAAAGGTAACATAGAATTGTGCAGAAGAAGAGTAATCGCCTATGAGGACGTAATCTT 
TCCAATTTACATCACATATGTTTACACCTGGAGGTGGCTGAAATGATGCCTCATTCAACCTGAATGGATAACCAA 
AAGTAGCTGTAACAGTTTTATTCTCAGTAACTGCAGGAGGACAATTCACTTGAATTTCTGTTTGGCCCTTAAAAC 
CTCCACAGGTGGCAAAAGCAAAGATAGAAGCAATCCACTCGAGGACCTTGATGAAGCCGAGTGGCTCCTTGAGCG 
GGTTGAGGTTGATCTGGAAGCCGCCGCAGGACCCAGCACAGTGGTTAGATAGATAAAGCGGCCGCTCGACTAGTC 
TGAGGTCTGATACTC^CTGACTGTCGTAAGGGCGAATTCGCGGCCGCTAAATTCAATTCGCCCTATAGTGAGTCG 
TATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTT 
GC^GCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGACCGATCGCCCTTCCCAACAGTTGCGCA 
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GCCTATACGTACGGCAGTTTAAGGTTTTACNACCTATTAAAAGAGAGAGCCCGGTATCGTCTGTTGTGGATGTNC 
AG AGTGATATTTATTTGGAACC CCGGGGAAAGAAT 

SEQ ID NO:462 

5 >gi|5803184|ref |NM_006754.l| Homo sapiens synaptophysin-like protein 
(SYPL) , mRNA 

TGAACCGAGGCAAGGGGGCGCGGCGCACGCAGTATGGCGCCCAACATCTACTTGGTTCGCCAGCGGATCA 
GTCGACTCGGCCAGAGGATGTCCGGCTTCCAGATCAACCTCAACCCGCTCAAGGAGCCACTCGGCTTCAT 
CAAGGTCCTCGAGTGGATTGCTTCTATCTTTGCTTTTGCCACCTGTGGAGGTTTTAAGGGCCAAACAGAA 

10 ATTCAAGTGAATTGTCCTCCTGCAGTTACTGAGAATAAAACTGTTACAGCTACTTTTGGTTATCCATTCA 
GGTTGAATGAGGCATCATTTCAGCCACCTCCAGGTGTAAACATATGTGATGTAAATTGGAAAGATTACGT 
CCTCATAGGCGATTACTCTTCTTCTGCACAATTCTATGTTACCTTTGCAGTCTTTGTGTTCCTGTACTGC 
ATTGCTGCCCTTCTGCTTTATGTTGGCTACACGAGTCTGTATCTGGATAGTCGTAAACTTCCTATGATAG 
ACTTTGTTGTTACACTTGTTGCCACTTTTTTGTGGTTGGTGAGCACTTCAGCCTGGGCTAAAGCTCTGAC 

15 AGATATTAAAATAGCTACTGGTCACAATATTATTGATGAACTTCCGCCTTGTAAGAAGAAAGCAGTACTG 
TGTTACTTTGGCTCTGTGACCAGTATGGGATCCCTAAATGTATCTGTGATATTTGGCTTTCTAAATATGA 
TACTCTGGGGAGGAAATGCTTGGTTTGTGTACAAGGAGACCAGCCTACACAGTCCATCAAATACATCTGC 
CCCTCATAGCCAAGGAGGTATTCCACCTCCTACCGGAATATAATTAAAGGGAGAAATACACTGTATGAAG 
TATATGTTGATACTATGACATGTTGCCAACACCTTGAGAAGCATTATTTGTTTCTAATAAAAGTAATGGC 



TTAAGATGTAAATGTATTCTCACATGTAAATTTGAAAGTTCAGGGGTCTATTATGAAATGGATTACACAT 

TGTATCGCTCATTCAGAAAACTGTTATATTGTCATACCATAGTAGGAAGAAAAACCTTTATTTGGAATAT 
ACACTACTGTAAGTTTGTACAGATCATATACCTACCACCTGTCTTTGCTTAAAGAGCCTTGATTACATAA 

25 ATATGTAGGAAAAAACATATTGAGTTCAAAATTTATATCTAACATTGTTTATGTTATGATTTTTTTTAAT 
TGCAAAGACTAGGTGTATATTTTTTTCTGTTTTTCTAAATGACCCGTGGTACTTAATAGGTGTACTAAAA 
TTGTGTTGGGAGCAGGGATTTGGAAATTTCTGAGAGATGTGTAGTTAATTAGTAATTCTGTTTCATGAGA 
TATGATCTGTTATGCTAGTGGTTTAATAGGCTTGCTATGTAAGTAGAACGTGGCTCAACTAGATATCTTA 
TATGTATGGGCATTACCTCTTAGTGATATTTGTTTCCTGTCCTTTGTTGCTCATGCTGTTTAAGTGCAGG 

30 CTGAGACCCAGCCTCTTTGTAAGTACAGTAAAATAATCCACCGTTTTTTACAGACCCTAGTCAAAGGGTT 
AAAAAAATTAAGATTGCTTTCCATGTTTGAAATTTACCATTGAGAGTCAATGAAGTTGCTATTTTGAGTT 
TAGCATTGATATTGTGAAAATAAGTGCAATTTGGATTTCATGTTTCTTAATATTCATTCTTGTTTCACAA 
ATGAATGATTAAGGAATTATGCATCATAAAGGAACCTAAGTGAGGTATATGATGAGTGTATTGTCTTTGC 
ACACACATATAGGTATATTCTGAATACAAGCTTATTCATTTTGCTTCCTAATCTTTTTGTTGTACAGGGA 

35 TTCAGGTTTCTTATTCTTACAACATGATTGTTTATATGTGAAGCACATCTTGCTGTTGCCTTATTTTTGA 
TGCTTTTATTCATGACAAGAATTGTCAATATAAGAATGTATATCTTTGCCGCAACCAATTTAATAAAGGA 
GTTGAAAGAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO:463 

40 >gi | 5803185 | ref | NP_006745 . 1 | synaptophys in -like protein; pantophysin [Homo 
sapiens] 

MAPNIYLVRQRISRLGQRMSGFQINLNPLKEPLGFIKVLEWIASIFAFATCGGFKGQTEIQVNCPPAVTE 
I^TVTATFGYPFRLNEASFQPPPGVNICDVNWKBYVLIGDYS s s aqf yvtfavfvflyc I aalllyvgyt 
SLYLDSRKLPMIDFWTLVATFLWLVSTSAWAK^ 
45 LNVSVIFGFLNMILWGGNAWFVYKETSLHSPSNTSAPHSQGGIPPPTGI 

SEQ ID NO:464 
Thrombospondin- 1 

>GL2_199_2_M13R Direction: anti-sense 
50 CCAATGTAGTTAGTGCGGATGGCAGGGCTGGAACCATTCACCACGTTGTTGTCAAGGGTGAGGAGGACACTGGTA 
GAGCTGGAGCAGCCTTTGTTCCTGAGGATGTCTTCTGGTGTGGTTCCAAAGACAAACCTCACATTCTGCAGCACC 
CCCTGGAAAATGTCA 

SEQIDNO:465 
55 >GL2_199JL_M13R Direction: sense 

TGACATTTTCCAGGGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACA 
AAGGCTGCTCCAGCTCTACCAGTGTNCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGGCATCC 
GNACTAACTACATTGG 



20 
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SEQ ID NO:466 

>GL2 199 3 M13F Direction: sense „ 

TGACaVtTTCCAGGGGGTGCTGCAGAATGTGAGGTTTC^^ 

AGGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCG 
CACTAACTACATTGG 

SEQ ID NO:467 

>GL2 66 FOl G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGCTCCAGCTCTACCAGTGTCTCCTCACCTTGACAACAACGTGGTGAATGGTTCCAGCCTGGCATCCGACTA 

ACTACATTGG 

SEQ ID NO:468 

>GL2 66 F02 G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGC 

ACTAACTACATTGG 



SEQ ID NO:469 

>GL2 66 F03 G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 

GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCG 
CACTAACTACATTGG 

SEQ ID NO:470 

>GL2 66 F04 G3F1 Direction: sense „ „ 

TGACATTTTCCAGGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAA 

AGGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCG 
CACTAACTACATTGG 

SEQE)NO:471 

>GL2 86 2A01 G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGC 

ACTAACTACATTGG 

SEQ ID NO:472 

>GL2 86 2A02 G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCG 

CACTAACTACATTGG 

SEQ ID NO:473 

>GL2 86 2A03 G3F1 Direction: sense 

TGACATTTTCC^GGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCG 

CACTAACTACATTGG 

SEQ ID NO:474 

>GL2 86 2A04 G3F1 Direction: N/A 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGTCCAGCTCTACAGCGTCCTCCTCA 

SEQ ID NO:475 

>GL2 86 2A06 G3F1 Direction: sense 

TGACATTTCAGGG^TGCTGCAGAATGTGAGGTTTGTCTTTGGAATCAC^CCAGAAGACATCCTCAGGAACAAAGG 
CTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGCAC 

TAACTACATTGG 
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SEQ ID NO:476 

>GL2_86_2A07_G3F1 Direction: sense 

CGACATTTTCCAGGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAA 
AGGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCG 
5 CACTAACTACATTGG 

SEQ ID NO:477 

>GL2_86_2A08_G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
10 GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGC 
ACTAACTACATTGG 

SEQ ID NO:478 

>GL2_86_2A09_G3F1 Direction: sense 
15 GTGACATTATCCAGTGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACNCAGAAGACATCCTCAGGAAC 
AAAGGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGATGGTGAATGGTTCCAGCCCTGNCAT 
CCGCACTAACTACATTGG 

SEQ ID NO:479 

20 >GL2_199_3_M13R Direction: anti-sense 

CCAATGTAGTTAGTGCGGATGGCAGGGCTGGAACCATTCACCACGTTGTTGTCAAGGGTGAGGAGGACACTGGTA 
GAGCTGGAGCAGCCTTTGTTCCTGAGGATGTCTTCTGGTGTGGTTCCAAAGACAAACCTCACATTCTGCAGCACC 
CCCTGGAAAATGTCA 

25 SEQ ID NO:480 

>GL2_8 6_2A05_G3F1 Direction: sense 

TGACATTTTCCAGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAA 
GGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGC 
ACTAACTACATTGG 

30 

SEQIDNO:481 

>gi | 4507484 |ref |NM_003246 . 1 | Homo sapiens thrombospondin 1 (THBS1) , mRNA 

GGACGCACAGGCATTCCCCGCGCCCCTCCAGCCCTCGCCGCCCTCGCCACCGCTCCCGGCCGCCGCGCTC 

CGGTACACACAGGATCCCTGCTGGGCACCAACAGCTCCACCATGGGGCTGGCCTGGGGACTAGGCGTCCT 

35 GTTCCTGATGCATGTGTGTGGCACCAACCGCATTCCAGAGTCTGGCGGAGACAACAGCGTGTTTGACATC 
TTTGAACTCACCGGGGCCGCCCGCAAGGGGTCTGGGCGCCGACTGGTGAAGGGCCCCGACCCTTCCAGCC 
CAGCTTTCCGCATCGAGGATGCCAACCTGATCCCCCCTGTGCCTGATGACAAGTTCCAAGACCTGGTGGA 
TGCTGTGCGGGCAGAAAAGGGTTTCCTCCTTCTGGCATCCCTGAGGCAGATGAAGAAGACCCGGGGCACG 
CTGCTGGCCCTGGAGCGGAAAGACCACTCTGGCCAGGTCTTCAGCGTGGTGTCCAATGGCAAGGCGGGCA 

40 CCCTGGACCTCAGCCTGACCGTCCAAGGAAAGCAGCACGTGGTGTCTGTGGAAGAAGCTCTCCTGGCAAC 
CGGCCAGTGGAAGAGCATCACCCTGTTTGTGCAGGAAGACAGGGCCCAGCTGTACATCGACTGTGAAAAG 
ATGGAGAATGCTGAGTTGGACGTCCCCATCCAAAGCGTCTTCACCAGAGACCTGGCCAGCATCGCCAGAC 
TCCGCATCGCAAAGGGGGGCGTCAATGACAATTTCCAGGGGGTGCTGCAGAATGTGAGGTTTGTCTTTGG 
AACCACACCAGAAGACATCCTCAGGAACAAAGGCTGCTCCAGCTCTACCAGTGTCCTCCTCACCCTTGAC 

45 AACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGCACTAACTACATTGGCCACAAGACAAAGGACTTGC 
AAGCCATCTGCGGCATCTCCTGTGATGAGCTGTCCAGCATGGTCCTGGAACTCAGGGGCCTGCGCACCAT 
TGTGACCACGCTGCAGGACAGCATCCGCAAAGTGACTGAAGAGAACAAAGAGTTGGCCAATGAGCTGAGG 
CGGCCTCCCCTATGCTATCACAACGGAGTTCAGTACAGAAATAACGAGGAATGGACTGTTGATAGCTGCA 
CTGAGTGTC ACTGTCAGAACTCAGTT AC C ATCTGCAAAAAGGTGTCCTGCCC C ATCATGCC CTGCTCCAA 

50 TGCCACAGTTCCTGATGGAGAATGCTGTCCTCGCTGTTGGCCCAGCGACTCTGCGGACGATGGCTGGTCT 
CCATGGTCCGAGTGGACCTCCTGTTCTACGAGCTGTGGCAATGGAATTCAGCAGCGCGGCCGCTCCTGCG 
ATAGCCTCAACAACCGATGTGAGGGCTCCTCGGTCCAGACACGGACCTGCCACATTCAGGAGTGTGACAA 
AAGATTTAAACAGGATGGTGGCTGGAGCCACTGGTCCCCGTGGTCATCTTGTTCTGTGACATGTGGTGAT 
GGTGTGATC ACAAGGATCCGGCTCTG CAACTCTCC CAGC CCCCAGATGAATGGGAAACCCTGTGAAGGCG 

55 AAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCATCAATGGAGGCTGGGGTCCTTGGTCACC 
ATGGGACATCTGTTCTGTCACCTGTGGAGGAGGGGTACAGAAACGTAGTCGTCTCTGCAACAACCCCGCA 
CCCCAGTTTGGAGGCAAGGACTGCGTTGGTGATGTAACAGAAAACCAGATCTGCAACAAGCAGGACTGTC 
CAATTGATGGATGCCTGTCCAATCCCTGCTTTGCCGGCGTGAAGTGTACTAGCTACCCTGATGGCAGCTG 
GAAATGTGGTGCTTGTCCCCCTGGTTACAGTGGAAATGGCATCCAGTGCACAGATGTTGATGAGTGCAAA 

60 GAAGTGCCTGATGCCTGCTTCAACCACAATGGAGAGCACCGGTGTGAGAACACGGACCCCGGCTACAACT 
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GCCTGCCCTGCCCCCCACGCTTCACCGGCTCACAGCCCTTCGGCCAGGGTGTCGAACIATGCCACGGCCAA 
CAAACAGGTGTGCAAGCCCCGTAACCCCTGCACGGATGGGACCCACGACTGCAACAAGAACGCCAAGTGC 
AACTACCTGGGCCACTATAGCGACCCCATGTACCGCTGCGAGTGCAAGCCTGGCTACGCTGGCAATGGCA 
TCATCTGCGGGGAGGACACAGACCTGGATGGCTGGCCCAATGAGAACCTGGTGTGCGTGGCCAATGCGAC 
5 TTACCACTGCAAAAAGGATAATTGCC CCAACCTTC C CAACTCAGGGCAGGAAGACTATGACAAGGATGGA 
ATTGGTGATGCCTGTGATGATGACGATGACAATGATAAAATTCCAGATGACAGGGACAACTGTCCATTCC 
ATTACAACCCAGCTCAGTATGACTATGACAGAGATGATGTGGGAGACCGCTGTGACAACTGTCCCTACAA 
CCACAACCCAGATCAGGCAGACACAGACAACAATGGGGAAGGAGACGCCTGTGCTGCAGACATTGATGGA 
GACGGTATCCTCAATGAACGGGACAACTGCCAGTACGTCTACAATGTGGACCAGAGAGACACTGATATGG 

10 

ATGGGGTTGGAGATCAGTGTGACAATTGCCCCTTGGAACACAATCCGGATCAGCTGGACTCTGACTCAGA 
CCGCATTGGAGATACCTGTGACAACAATCAGGATATTGATGAAGATGGCCACCAGAACAATCTGGACAAC 
TGTCCCTATGTGCCCAATGCCAACCAGGCTGACCATGACAAAGATGGCAAGGGAGATGCCTGTGACCACG 
ATGATGACAACGATGGCATTCCTGATGACAAGGACAACTGCAGACTCGTGCCCAATCCCGACCAGAAGGA 
CTCTGACGGCGATGGTCGAGGTGATGCCTGCAAAGATGATTTTGACCATGACAGTGTGCCAGACATCGAT 

15 

GACATCTGTCCTGAGAATGTTGACATCAGTGAGACCGATTTCCGCCGATTCCAGATGATTCCTCTGGACC 
C CAAAGGGACATCCCAAAATGACCCTAACTGGGTTGTACGC CATCAGGGTT^AAGAACTCGTC CAGACTGT 
CAACTGTGATCCTGGACTCGCTGTAGGTTATGATGAGTTTAATGCTGTGGACTTCAGTGGCACCTTCTTC 
ATCAACACCGAAAGGGACGATGACTATGCTGGATTTGTCTTTGGCTACCAGTCCAGCAGCCGCTTTTATG 
TTGTGATGTGGAAGCAAGTCACCCAGTCCTACTGGGACACCAACCCCACGAGGGCTCAGGGATACTCGGG 

20 CCTTTCTGTGAAAGTTGTAAACTCCACCACAGGGCCTGGCGAGCACCTGCGGAACGCCCTGTGGCACACA 
GGAAACACCCCTGGCCAGGTGCGCACCCTGTGGCATGACCCTCGTCACATAGGCTGGAAAGATTTCACCG 
CCTACAGATGGCGTCTCAGCCACAGGCCAAAGACGGGTTTCATTAGAGTGGTGATGTATGAAGGGAAGAA 
AATCATGGCTGACTCAGGACCCATCTATGATAAAACCTATGCTGGTGGTAGACTAGGGTTGTTTGTCTTC 
TCTCAAGAAATGGTGTTCTTCTCTGACCTGAAATACGAATGTAGAGATCCCTAATCATCAAATTGTTGAT 

25 TGAAAGACTGATCATAAACCAATGCTGGTATTGCACCTTCTGGAACTATGGGCTTGAGAAAACCCCCAGG 
ATCACTTCTCCTTGGCTTCCTTCTTTTCTGTGCTTGCATCAGTGTGGACTCCTAGAACGTGCGACCTGCC 
TCAAGAAAATGCAGTTTTCAAAAACAGACTCATCAGCATTCAGCCTCCAATGAATAAGACATCTTCCAAG 
CATATAAACAATTGCTTTGGTTTCCTTTTGAAAAAGCATCTACTTGCTTCAGTTGGGAAGGTGCCCATTC 
CACTCTGCCTTTGTCACAGAGCAGGGTGCTATTGTGAGGCCATCTCTGAGCAGTGGACTCAAAAGCATTT 

30 TCAGGCATGTCAGAGAAGGGAGGACTCACTAGAATTAGCAAACAAAACCACCCTGACATCCTCCTTCAGG 
AACACGGGGAGCAGAGGCCAAAGCACTAAGGGGAGGGCGCATACCCGAGACGATTGTATGAAGAAAATAT 
GGAGGAACTGTTACATGTTCGGTACTAAGTCATTTTCAGGGGATTGAAAGACTATTGCTGGATTTCATGA 
TGCTGACTGGCGTTAGCTGATTAACCCATGTAAATAGGCACTTAAATAGAAGCAGGAAAGGGAGACAAAG 
ACTGGCTTCTGGACTTCCTCCCTGATCCCCACCCTTACTCATCACCTTGCAGTGGCCAGAATTAGGGAAT 

35 CAGAATCAAACCAGTGTAAGGCAGTGCTGGCTGCCATTGCCTGGTCACATTGAAATTGGTGGCTTCATTC 
TAGATGTAGCTTGTGCAGATGTAGCAGGAAAATAGGAAAACCTACCATCTCAGTGAGCACCAGCTGCCTC 
CCAAAGGAGGGGCAGCCGTGCTTATATTTTTATGGTTACAATGGCACAAAATTATTATCAACCTAACTAA 
AACATTCCTTTTCTCTTTTTTCCGTAATTACTAGGTAGTTTTCTAATTCTCTCTTTTGGAAGTATGATTT 
TTTTAAAGTCTTTACGATGTAAAATATTTATTTTTTACTTATTCTGGAAGATCTGGCTGAAGGATTATTC 

40 ATGGAACAGGAAGAAGCGTAAAGACTATCCATGTCATCTTTGTTGAGAGTCTTCGTGACTGTAAGATTGT 
AAATACAGATTATTTATTAACTCTGTTCTGCCTGGAAATTTAGGCTTCATACGGAAAGTGTTTGAGAGCA 
AGTAGTTGACATTTATCAGCAAATCTCTTGCAAGAACAGCACAAGGAAAATCAGTCTAATAAGCTGCTCT 
GCCCCTTGTGCTCAGAGTGGATGTTATGGGATTCCTTTTTTCTCTGTTTTATCTTTTCAAGTGGAATTAG 
TTGGTTATCGATTTGCAAATGTTTTAAATTGC^^ 

45 TCCCTTGTGCATATTTCCAGGGAGAAGGAAAGCATATACACTTTTTTCTTTCATTTTTCCAAAAGAGAAA 
AAAATGACAAAAGGTGAAACTTACATACAAATATTACCTCATTTGTTGTGTGACTGAGTAAAGAATTTTT 
GGATCAAGCGGATU^GAGTTTAAGTGTCTAACAAACTTAAAGCTACTGTAGTACCTAAAAAGTCAGTGTTG 
TACATAGCATAAAAACTCTGCAGAGAAGTATTCCCAATAAGGAAATAGCATTGAAATGTTAAATACAATT 
TCTGAAAGTTATGTTTTTTTTCTATCATCTGGTATACCATTGCTTTATTTTTATAAATTATTTTCTCATT 

50 GCCATTGGAATAGAATATTCAGATTGTGTAGATATGCTATTTAAATAATTTATCAGGAAATACTGCCTGT 
AGAGTTAGTATTTCTATTTTTATATAATGTTTGCACACTGAATTGAAGAATTGTTGGTTTTTTCTTTTTT 
TTGTTTTTTTTTTTTTTTT^^ 

CTAGGAATGTGCTTTTTTTTGTACACATTTTTATCCATTTTACATTCTAAAGCAGTGTAAGTTGTATATT 
ACTGTTTCTTATGTACAAGGAACAACAATAAATCATATGGAAATTTATATTT 

55 

SEQ ID NO:482 

>gi | 4507485 |ref|NP_O03237.l| thrombospondin 1 [Homo sapiens] 
MGLAWGLGVLFLMHVCGTNRIPESGGDNSVFDIFELTC 
PDDKPQDLVDAVRAEKGFLLLASLRQMKJCTRGTLLA^ 
60 VS VEEAIilATGQWKS ITLFVQEDRAQLYIDCEKMENAELDVPIQS VFTRDLAS IARIiRIAKGGVNDNFQG 
VLQNVRFWGTTPEDILRl^GCSSSTSVLLT^ 
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VLELRGLRTIVTTLQDSIRKVTEENKEL 

VSCPIMPCSNATVPDGECCPRCWPSDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQT 
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACP 
INGGWGPWSPWDICSVTCGGGVQKRSRLCNNPAPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGV 
KCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGYNCIjPCPPRFTGSQPF 
GQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPH 
ENXiVCVANATYHCKKDNCPNL PNSGQED YDKDG IGDACDDDDDNDKI PDDRDNCPFH YNPAQ YD YDRDD V 
GDRCDNCP YNHNPDQADTDNNGEGDACAADIDGDG I LNERDNCQYVY1JVDQRDTDMDGVGDQCDNCPLEH 
NPD QLDSDSDRI GDT CDNNQD IDEDGHQNNLDNCP YVPNANQADHDKDGKGDACDHDDDNDG I PDDKDNC 
RLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRFQMIPLDPKGTSQ1TOPNWVVR 
HQGKELVQTVNCD PGLAVGYDEFNAVDFS GTFF INTERDDDYAGFVFGYQS S SRFYWMWKQVTQS YWDT 
NPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGF 
IRWMYEGKKIMADSGPIYDKTYAGGRLGLFVFSQEMVFFSDLKYECRDP 

SEQ ID NO:483 
Transgelin 2 

>GL3_3 0H_PCR_G3F1 Direction: sense 

CACTGTACCCGAGGGGCAGGCCCCAGTAAAGAAGATCAGGCCTCCACCATGGCCTTCAAGCAGATGGAGCAGATC 
TCTCAG TTCCTG 

SEQ ID NO:484 

>gi | 4507356 |ref|NM_003564.l| Homo sapiens transgelin 2 (TAGLN2) , mRNA 

GCCCTTGCCTTGAGTCAGTGCGCTGCTCTCCAGCCCGCTTGAACGCTCCCCGCAGCCACCGCCACCCATT 

GGAATGGCCAACAGGGGACCTGCATATGGCCTGAGCCGGGAGGTGCAGCAGAAGATTGAGAAACAATATG 

ATGCAGATCTGGAGCAGATCCTGATCCAGTGGATCACCACCCAGTGCCGAAAGGATGTGGGCCGGCCCCA 

GCCTGGACGCGAGAACTTCCAGAACTGGCTCAAGGATGGCACGGTGCTATGTGAGCTCATTAATGCACTG 

TACCCCGAGGGGCAGGCCCCAGTAAAGAAGATCCAGGCCTCCACCATGGCCTTCAAGCAGATGGAGCAGA 

TCTCTCAGTTCCTGCAAGCAGCTGAGCGCTATGGCATTAACACCACTGACATCTTCCAAACTGTGGACCT 

CTGGGAAGGAAAGAACATGGCCTGTGTGCAGCGGACGCTGATGAATCTGGGTGGGCTGGCAGTAGCCCGA 

GATGATGGGCTCTTCTCTGGGGATCCCAACTGGTTCCCTAAGAAATCCAAGGAGAATCCTCGGAACTTCT 

CAGATAACCAGCTGCAAGAGGGCAAGAACGTGATCGGGTTACAGATGGGCACCAACCGCGGGGCGTCTCA 

GGCAGGCATGACTGGCTACGGGATGCCACGCCAGATCCTCTGATCCCACCCCAGGCCTTGCCCCTGCCCT 

CCCACGAATGGTTAATATATATGTAGATATATATTTTAGCAGTGACATTCCCAGAGAGCCCCAGAGCTCT 

CAAGCTCCTTTCTGTCAGGGTGGGGGGTTCAGCCTGTCCTGTCACCTCTGAGGTGCCTGCTGGCATCCTC 

TCCCCCATGCTTACTAATACATTCCCTTCCCCATAGCCATCAAAACTGGACCAACTGGCCTCTTCCTTTC 

CCCTGGGACCAAAATTTAGGGGCCTCAGTCCCTCACCGCCATGCCCTGGCCTATTCTGTCTCTCCTTCTT 

CCCCCTGGCCTGTTCTGTCTCTGAGCTCTGTGTCCTCCGTTCATTCCATGGCTGGGAGTCACTGATGCTG 

CCTCTGCCTTCTGATGCTGGACTGGCCTTGCTTCTACAAGTATGCTTCTCCCACAGCTGTGGCTGCAGGA 

ACTTAATTTATAGGGAGGAGCCTGTGGCAGCTGCTGCCCCAGCCACAGCTGCACTGACTGTGCTCACCAC 

ACATCTGGGGC^GCCTTCCCTGGCAGGGGCCCTCGTGGCTTCTCATTTTCCATTCCCTTCACTGTGGCTA 

AGGGGTGGGGTGAGGGGATGGAGAGGGAGGGCTGCCTACCATGGTCTGGGGCTTGAGGAAGATGAGTTTG 
TTGATTTAAATAAAGAATTTGTCATTTTTG 

SEQ ID NO:485 

>gi | 4507357 |ref|NP_003555.l| transgelin 2; SM22-alpha homolog [Homo 
sapiens] 

MANRGPAYGLSREVQQKIEKQYDADLEQ 

PEGQAPVKKIQASTMZVFKQMEQI SQFLQAAERYGINTTDI FQTVDLWEGKNMACVQRTLMNLGGLAVARD 
DGLFSGDPNWFPKKSKENPRNFSDNQLQEGKNVIGLQMGTNRGASQAGMTGYGMPRQIL 

SEQ ID NO:486 
Vigil in 

>GL3_43A1_1JVI13F Direction: anti-sense 

CCGGGCAGCTTCGCAGTTGGCTCGCTTGCCTGTGATGATAATGGTCTCTGAATTGCTATTCTCTGCTGGAAGGGT 
CGATTTTGGTGTTGCTTTCTTCACGAATCTTTTTAATGTTTGCGCCTCCTTTCCCAATGATATTCTTGTGAAACT 
GTTTGAAGATCGGAACAGAAATTGAATAGCTATTTTCCACCAGATCTGCCACCATCTTCTGCATGTATTTTGTGC 
ATTTTTCCACCTCATTCTTAGGTCCTCTGAGCTGGACAATGTCACTTTTTTGTGCTGGG 
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SEQ ID NO:487 

>GL3_43A1_1_M13R Direction: sense 

CCCAGCACAAAAAAGTGACATTGTCCAGCTCAGAGGACCTAAGAATGAGGTGGAAAAATGCACAAAATACATGCA 
GAAGATGGTGGCAGATCTGGTGGAAAATAGCTATTCAATTTCTGTTCCGATCTTCAAACAGTTTCACAAGAATAT 
5 CATTGGGAAAGGAGGCGCAAACATTAAAAAGATTCGTGAAGAAAGCAACACCAAAATCGACCTTCCAGCAGAGAA 
TAGCAATTCAGAGACCATTATCATCACAGGCAAGCGAGCCAACTGCGAAGCTGCCCGG 

SEQ ID NO:488 

>GL3_43A1_2_M13F Direction: sense 

10 CCCAGGACAAAAAAGTGACATTGTCCAGCTCAGAGGACCTAAGAATGAGGTGGAAAAATGCACAAAATACATGCA 
GAAGATGGTGGCAGATCGTGGTGGAAAATAGCTATTCAATTTGCTGTTCCGATTCTTCAAACAGTTTCACAAGAA 
TATCATTGGGAAAGGAGGCGCAAACGATTCAAGAAAGATTCGTGAAGAAAGCAACACCAAAATCGACCTTCCAGC 
AGAGAATAGCAATTTCAGAGACCATTATCAGTCACAGGCAAGCGAGCCAACTGCGAAGGGTTGCCCTGGTCCGCT 
AGAAACCCAGCACAGGTGGTTAGATGGATACAAGCGGCCGCTCGACTAGTCTGAGGTCTGATACTTCACTGACTA 

15 GTCGTTAAGGGCGAATTCCAGCACACTGGCGGCCGTCGACTAGTGGATCCGAGCTC 

SEQ ID NO:489 

>GL3_43A1_2_M13R Direction: anti- sense 

CCGGGCAGCTTCGCAGTTGGCTCGCTTGCCTGTGATGATAATGGTCTCTGAATTGCTATTCTCTGCTGGAAGGTC 
20 GATTTTGGTGTTGCTTTCTTCACGAATCTTTTTAATGTTTGCGCCTCCTTTCCCAATGATATTCTTGTGAAACTG 
TTTGAAGATCGGAACAGAAATTGAATAGCTATTTTCCACCAGATCTGCCACCATCTTCTGCATGTATTTTGTGCA 
TTTTTCCACCTCATTCTTAGGTCCTCTGAGCTGGACAATGTCACTTTTTTGTGCTGGG 

SEQ ID NO:490 

25 >GL3_43A1_4_M13F Direction: sense 

CCCAGCACAAAAAAGTGACATTGTCCAGCTCAGAGGACCTAAGAATGAGGTGGAAAAATGCACAAAATACATGCA 
GAAGATGGTGGCAGATCTGGTGGAAAATAGCTATTCAATTTCTGTTCCGATCTTCAAACAGTTTCACAAGAATAT 
CATTGGGAAAGGAGGCGCAAACATTAAAAAGATTCGTGAAGAAAGCAACACCAAAATCGACCTTCCAGCAGAGAA 
TAGCAATTCAGAGACCATTATCATCACAGGCAAGCGAGCCAACTGCGAAGTTGCCCGGCCGCAGAAACCCAGCAC 

30 AGTGGTTAGATAGATAAAGCGGCCGCTCGACTAGTCTGAGGTCTGATACTCACTGACTGTCGTAAGGGCGAATTC 
CAGCACACTGGCGGCCGTTACTAGTGGATCCGAGCTCGGTACCAAGCTTGGCGTAATCATGGT 

SEQ ID NO:491 

>GL3__43A1_4_M13R Direction: anti-sense 
35 CCGGGCAGCTTCGCAGTTGGCTCGCTTGCCTGTGATGATAATGGTCTCTGAATTGCTATTCTCTGCTGGAAGGTC 
GATTTTGGTGTTGCTTTCTTCACGAATCTTTTTAATGTTTGCGCCTCCTTTCCCAATGATATTCTTGTGAAACTG 
TTTGAAGATCGGAACAGAAATTGAATAGCTATTTTCCACCAGATCTGCCACCATCTTCTGCATGTATTTTGTGCA 
TTTTTCCACCTCATTCTTAGGTCCTCTGAGCTGGACAATGTCACTTTTTTGTGCTGGG 

40 SEQ ID NO:492 

>GL3_43A_PCR_G3F1 Direction: sense 

CCCAGCACAAAAAAGTGACATTGTCCAGCTCAGAGGACCTAAGAATGAGGTGGAAAAATGCACAAAATACATGCA 
GAAGATGGTGGCAGATCTGGTGGAAAATAGCTATTCAATTTCTGTTCCGATCTTCAAACAGTTTCACAAGAATAT 
CATTGGGAAAGGAGGCGCAAACATTAAAAAGATTCGTGAAGAAAGCAACACCAAAATCGACCTTCCAGCAGAGAA 
45 TAGCAAGTTCAGAGACCATTATCATCACAGGCAAGCGAGCCAACTGCGAAGCTGCCCGG 

SEQ ID NO:493 

>gi|4885408 |ref |NM_005336 . 1 | Homo sapiens high density lipoprotein binding 
protein (vigilin) (HDLBP) , mRNA 

50 GAATTCGGGGGGCGAGTAAGCCAGCGGCAGGACCAGCGGGCGGGGGCCACAACAAAAGCTGGCAGGCTGA 
CAGAGGCGGCCTCAGGACGGACCTTCTGGCTACTGACCGTTTTGCTGTGGTTTTCCCGGATTGTGTGTAG 
GTGTGAGATCAACCATGAGTTCCGTTGCAGTTTTGACCCAAGAGAGTTTTGCTGAACACCGAAGTGGGCT 
GGTTCCGCAACAAATC^U^GTTGCCACTCTAAATTCAGAAGAGGAGAGCGACCCTCCAACCTACAAGGAT 
GCCTTCCCTCCACTTCCTGAGAAAGCTGCTTGCCTGGAAAGTGCCCAGGAACCCGCTGGAGCCTGGGGGA 

55 ACAAGATCCGACCCATCAAGGCTTCTGTCATCACTCAGGTGTTCCATGTACCCCTGGAGGAGAGAAAATA 
CAAGGATATGAACCAGTTTGGAGAAGGTGAACAAGCAAAAATCTGCCTTGAGATCATGCAGAGAACTGGT 
GCTCACTTGGAGCTGTCTTTGGCCAAAGACCAAGGCCTCTCCATCATGGTGTCAGGAAAGCTGGATGCTG 
T(^TGAAAGCTCGGAAGGACATTGTTGCTAGACTGCAGACTCAGGCCTCAGCAACTGTTGCC71TTCCCAA 
AGAACACCATCGCTTTGTTATTGGCAAAAATGGAGAGAAACTGCAAGACTTGGAGCTAAAAAC 

60 AAAATCCAGATCCCACGCCCAGATGACCCCAGCAATCAGATCAAGATCACTGGCACCAAAGAGGGCATCG 
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AGAAAGCTCGCCATGAAGTCTTACTCATCTCTGCCGAGCAGGACAAACGTGCTGTGGAGAGGCTAGAAGT 
AGAAAAGGCATTCCACCCCTTCATCGCTGGGCCGTATAATAGACTGGTTGGCGAGATCATGCAGGAGACA 
GGCACGCGCATCAACATCCCCCCACCCAGCGTGAACCGGACAGAGATTGTCTTCACTGGAGAGAAGGAAC 
AGTTGGCTCAGGCTGTGGCTCGCATCAAGAAGATTTATGAGGAGAAGAAAAAGAAGACTACAACCATTGC 
5 AGTGGAAGTGAAGAAATCCCAACACAAGTATGTCATTGGGCCCAAGGGCAATTCATTGCAGGAGATCCTT 
GAGAGAACTGGAGTTTCCGTTGAGATCCCACCCTCAGACAGCATCTCTGAGACTGTAATACTTCGAGGCG 
AACCTGAAAAGTTAGGTCAGGCGTTGACTGAAGTCTATGCCAAGGCCAATAGCTTCACCGTCTCCTCTGT 
CGCCGCCCCTTCCTGGCTTCACCGTTTCATCATTGGCAAGAAAGGGCAGAACCTGGCCAAAATCACTCAG 
CAGATGCCAAAGGTTCACATCGAGTTCACAGAGGGCGAAGACAAGATCACCCTGGAGGGCCCTACAGAGG 

10 ATGTCAATGTGGCCCAGGAACAGATAGAAGGCATGGTCAAAGATTTGATTAACCGGATGGACTATGTGGA 
GATCAACATCGACCACAAGTTCCACAGGCACCTCATTGGGAAGAGCGGTGCCAACATAAACAGAATCAAA 
GACCAGTACAAGGTGTCCGTGCGCATCCCTCCTGACAGTGAGAAGAGCAATTTGATCCGCATCGAGGGGG 
ACCCACAGGGCGTGCAGCAGGCCAAGCGAGAGCTGCTGGAGCTTGCATCTCGCATGGAAAATGAGCGTAC 
CAAGGATCTAATCATTGAGCAAAGATTTCATCGCACAATCATTGGGCAGAAGGGTGAACGGATCCGTGAA 

15 ATTCGTGACAAATTCCCAGAGGTCATCATTAACTTTCCAGACCCAGCACAAAAAAGTGACATTGTCCAGC 
TCAGAGGACCTAAGAATGAGGTGGAAAAATGCACAAAATACATGCAGAAGATGGTGGCAGATCTGGTGGA 
AAATAGCTATTCAATTTCTGTTCCGATCTTCAAACAGTTTCACAAGAATATCATTGGGAAAGGAGGCGCA 
AACATTAAAAAGATTCGTGAAGAAAGCAACACCAAAATCGACCTTCCAGCAGAGAATAGCAATTCAGAGA 
CCATTATCATCACAGGCAAGCGAGCCAACTGCGAAGCTGCCCGGAGCAGGATTCTGTCTATTCAGAAAGA 

20 CCTGGCCAACATAGCCGAGGTAGAGGTCTCCATCCCTGCCAAGCTGCACAACTCCCTCATTGGCACCAAG 
GGCCGTCTGATCCGCTCCATCATGGAGGAGTGCGGCGGGGTCCACATTCACTTTCCCGTGGAAGGTTCAG 
GAAGCGACACCGTTGTTATCAGGGGCCCTTCCTCGGATGTGGAGAAGGCCAAGAAGCAGCTCCTGCATCT 
GGCGGAGGAGAAGCAAACCAAGAGTTTCACTGTTGACATCCGCGCCAAGCCAGAATACCACAAATTCCTC 
ATCGGCAAGGGGGGCGGCAAAATTCGCAAGGTGCGCGACAGCACTGGAGCACGTGTCATCTTCCCTGCGG 

25 CTGAGGACAAGGACCAGGACCTGATCACCATCATTGGAAAGGAGGACGCCGTCCGAGAGGCACAGAAGGA 
GCTGGAGGCCTTGATCCAAAACCTGGATAATGTGGTGGAAGACTCCATGCTGGTGGACCCCAAGCACCAC 
CGCCACTTCGTCATCCGCAGAGGCCAGGTCTTGCGGGAGATTGCTGAAGAGTATGGCGGGGTGATGGTCA 
GCTTCCCACGCTCTGGCACACAGAGCGACAAAGTCACCCTCAAGGGCGCCAAGGACTGTGTGGAGGCAGC 
CAAGAAACGCATTCAGGAGATCATTGAGGACCTGGAAGCTCAGGTGACATTAGAATGTGCTATACCCCAG 

30 AAATTCCATCGATCTGTCATGGGCCCCAAAGGTTCCAGAATCCAGCAGATTACTCGGGATTTCAGTGTTC 
AAATTAAATTCCCAGACAGAGAGGAGAACGCAGTTCACAGTACAGAGCCAGTTGTCCAGGAGAATGGGGA 
CGAAGCTGGGGAGGGGAGAGAGGCTAAAGATTGTGACCCCGGCTCTCCAAGGAGGTGTGACATCATCATC 
ATCTCTGGCCGGAAAGAAAAGTGTGAGGCTGCCAAGGAAGCTCTGGAGGCATTGGTTCCTGTCACCATTG 
AAGTAGAGGTGCCCTTTGACCTTCACCGTTACGTTATTGGGCAGAAAGGAAGTGGGATCCGCAAGATGAT 

35 GGATGAGTTTGAGGTGAACATACATGTCCCGGCACCTGAGCTGCAGTCTGACATCATCGCCATCACGGGC 
CTCGCTGCAAATTTGGACCGGGCCAAGGCTGGACTGCTGGAGCGTGTGAAGGAGCTACAGGCCGAGCAGG 
AGGACCGGGCTTTAAGGAGTTTTAAGCTGAGTGTCACTGTAGACCCCAAATACCATCCCAAGATTATCGG 
GAGAAAGGGGGCAGTAATTACCCAAATCCGGTTGGAGCATGACGTGAACATCCAGTTTCCTGATAAGGAC 
GATGGGAACCAGCCCCAGGACCAAATTACCATCACAGGGTACGAAAAGAACACAGAAGCTGCCAGGGATG 

40 CTATACTGAGAATTGTGGGTGAACTTGAGCAGATGGTTTCTGAGGACGTCCCGCTGGACCACCGCGTTCA 
CGCCCGCATCATTGGTGCCCGCGGCAAAGCCATTCGCAAAATCATGGACGAATTCAAGGTGGACATTCGC 
TTCCCACAGAGCGGAGCCCCAGACCCCAACTGCGTCACTGTGACGGGGCTCCCAGAGAATGTGGAGGAAG 
CCATCGACCACATCCTCAATCTGGAGGAGGAATACCTAGCTGACGTGGTGGACAGTGAGGCGCTGCAGGT 
ATACATGAAACCCCCAGCACACGAAGAGGCCAAGGCACCTTCCAGAGGCTTTGTGGTGCGGGACGCACCC 

45 TGGACCGCCAGCAGCAGTGAGAAGGCTCCTGACATGAGCAGCTCTGAGGAATTTCCCAGCTTTGGGGCTC 
AGGTGGCTCCCAAGACCCTCCCTTGGGGCCCCAAACGATAATGATCAAAAAGAACAGAACCCTCTCCAGC 
CTGCTGACCCGAACCCAACCACACAATGGTTTGTCTCAATCTGACCCAGCGGCTGGACCCTCCGTAAATT 
GTTGAGCGCTCTTCCCCTTCCCGAGGTCCGCAGGGAGCCTAGCGCCTGGCTGTGTGTGCGGCCGCTCCTC 
CAGGCCTGGCCGTGCCCGCTCAGGACCTGCTCCACTGTTTAACAATAAACCAAGGTCATGAGCATTCGAG 

50 CTAAGATAACAGACTCCAGCTCCTGGTCCACCCGGCATGTCAGTCAGCACTCTGGCCTTCATCACGAGAG 
CTCCGCAGCCGTGGCTAGGATTCCACTTCCTGTGTCATGACCTCAGGAAATAAACGTCCTTGACTTTATA 
AAAGCCCCGAATTC 

SEQ ID NO:494 

55 >gi|4885409 |ref | NP_005327 . 1 1 high density lipoprotein binding protein; 
vigilin [Homo sapiens] 

MSSVAVLTQESFAEHRSGLVPQQIKVATLNSEEESDPPTYKDAFPPLPEKAACLESAQEPAGAWGNKIRP 
IKASVITQVFHVPLEERKYKDMNQFGEGEQAICI^ 

KDI VARLQTQAS ATVAI PKEHHRFVIGKNGEKLQDLELKTATKI Q I PRPDDP SNQ I KI TGTKEGIEKARH 
60 EVLLI SAEQDKRAVERLEVEKAFHPFIAGPYURLVGE IMQETGTRINI PPPS VNRTE IVFTGEKEQLAQA 
VARIKKIYEEKKKKTTTIAVEVKKSQHKYVIGPK^^ 
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GQALTEVYAKANSFTVSSVAAPSWLHRF I IGKKGQNLAKITQQMPKVHIEFTEGEDKITLEGPTEDVKVA 
QEQIEGMVKDLINRMDYVEINIDH^ 

QQAKRELLELASRMENERTKDIiI IEQRFHRTIIGQKGERIREIRDKFPE VT INFPDPAQKSDIVQLRGPK 
NEVEKCTKYMQKIWADLVENSYSISVPIFKQFHKNIIGKGGANIKKIREESNTKIDLPAENSNSETIIIT 
GKRANCEAARSRILS IQKDLANI AEVEVS I PAKLHNSLIGTKGRLIRS IMEECGGVHIHFPVEGSGSDTV 
VIRGPSSDVEKAKKQLLiHLAEEKQTKSFTVD^ 

QDLITIIGKEDAVREAQKELEALIQNIiDNVVEDSMLVDPKHHRHFVIRRGQVLREIAEEYGG 
GTQSDKVTLKGAIO)CVEAAKKRIQEIIEDLEAQVTLECAIPQKFHRSVMGPKGSRIQQITRDFSVQIKFP 
DREENAVHSTEPWQENGDEAGEGREAKDCDPGSPRRCDI 1 1 ISGRKEKCEAAKEALEALVPVTIEVEVP 
FDLHRYVIGQKGSGIRKMl^EFEVNIHVPAPELQSDIIAITGIjAANLDRAKAGL 

RSFKIiSVTVDPKYHPKIIGRKGAVITQIRLEHDWIQFPDKI)DGNQPQDQITITGYEKMTEAARDAILRI 

VGELEQMVSEDVPLDHRVHARI IGARGKAIRKIiy^ 

k^EEEYLADWDSEALQVYMKPPAHEEAK^ 

TLPWGPKR 



Vimentin 

>G3_8_04_PCR_G3F1 Direction: N/A 

GCCGTCCAGGCCATCGCCACCCTCCGCAGCCATGTCCACCAGGTCCGTGTCCTCGTCCTCCTACCGCAGATGTTC 
20 GGCGGCCCGGGCACCGCGAGCCGGGCCGAGCTCCAGCCGGAGCTACGTGACTACGTCCACCCGCACCTACAGCCT 
GGGCAGCGCGCTGCGCCCCAGCACCAGCGCAGCCTCTACGCCTCGTCCCCGGGCGGCGTGTATGCCACGCGCTCC 
TCTGCCGTGCGCCTGCGGAGCAGCGTGCCCGGGGTGCGGCTCCTGCAGGACTCGGTGGACTTCTCGCTGGCCGAC 
GCCATCAACACCGAGTTCAAGAACACCCGCACCAACAGAAGGTGGAGCTGCAGGAGCTGAATGACCGCTTCGCCA 
CTATATCGACAGGTGCGCTTCCTGGAGCAGCAGAATAAGATCCTGCTGGCCGAGCTCGAGCAGCTCAAGGGCCAA 
25 GGCAAGTCGCGCCTGGGGGACCTCTACGAGGAGGAGATGCGGGAGCTGCGCCGGC 

SEQ ID NO:496 

>gi|4507894|ref |NM_003 3 80 . 1 1 Homo sapiens vimentin (VIM) , mRNA 
GGGCGCGCCAGAGACGCAGCCGCGCTCCCACCACCCACACCCACCGCGCCCTCGTTCGCCTCTTCTCCGG 

30 

GAGCCAGTCCGCGCCACCGCCGCCGCCCAGGCCATCGCCACCCTCCGCAGCCATGTCCACCAGGTCCGTG 
TCCTCGTCCTCCTACCGCAGGATGTTCGGCGGCCCGGGCACCGCGAGCCGGCCGAGCTCCAGCCGGAGCT 
ACGTGACTACGTCCACCCGCACCTACAGCCTGGGCAGCGCGCTGCGCCCCAGCACCAGCCGCAGCCTCTA 
CGCCTCGTCCCCGGGCGGCGTGTATGCCACGCGCTCCTCTGCCGTGCGCCTGCGGAGCAGCGTGCCCGGG 
GTGCGGCTCCTGCAGGACTCGGTGGACTTCTCGCTGGCCGACGCCATCAACACCGAGTTCAAGAACACCC 

35 

GCACCAACGAGAAGGTGGAGCTGCAGGAGCTGAATGACCGCTTCGCCAACTACATCGACAAGGTGCGCTT 
CCTGGAGCAGCAQAATAAGATCCTGCTGGCCGAGCTCGAGCAGCTCAAGGGCCAAGGCAAGTCGCGCCTA 
GGGGACCTCTACGAGGAGGAGATGCGGGAGCTGCGCCGGCAGGTGGACCAGCTAACCAACGACAAAGCCC 
GCGTCGAGGTGGAGCGCGACAACCTGGCCGAGGACATCATGCGCCTCCGGGAGAAATTGCAGGAGGAGAT 
GCTTCAGAGAGAGGAAGCCGAAAACACCCTGCAATCTTTCAGACAGGATGTTGACAATGCGTCTCTGGCA 

40 

CGTCTTGACCTTGAACGCAAAGTGGAATCTTTGCAAGAAGAGATTGCCTTTTTGAAGAAACTCCACGAAG 
AGGAAATCCAGGAGCTGCAGGCTCAGATTCAGGAACAGCATGTCCAAATCGATGTGGATGTTTCCAAGCC 
TGACCTCACGGCTGCCCTGCGTGAGGTACGTCAGCAATATGAAAGTGTGGCTGCCAAGAACCTGCAGGAG 
GCAGAAGAATGGTACAAATCCAAGTTTGCTGACCTCTCTGAGGCTGCCAACCGGAACAATGACGCCCTGC 
GCCAGGCAAAGCAGGAGTCCACTGAGTACCGGAGACAGGTGCAGTCCCTCACCTGTGAAGTGGATGCCCT 

45 

TAAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCCGTTGAAGCTGCT 
AACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATGAAGGAGGAAATGGCTCGTCACC 
TTCGTGAATACCAAGACCTGCTCAATGTTAAGATGGCCCTTGACATTGAGATTGCCACCTACAGGAAGCT 
GCTGGAAGGCGAGGAGAGCAGGATTTCTCTGCCTCTTCCAAACTTTTCCTCCCTGAACCTGAGGGAAACT 
AATCTGGATTCACTCCCTCTGGTTGATACCCACTCAAAAAGGACATTCCTGATTAAGACGGTTGAAACTA 

50 GAGATGGACAGGTTATCAACGAAACTTCTCAGCATCACGATGACCTTGAATAAAAATTGCACACACTCAG 
TGGCAGGCGATATATTACCCAGGCAAGAATAAAAAAGAAATCCCATATCTTAAAGAAACAGCTTTCAAGT 
GCCTTTCTGCAGTTTTTCAGGAGCGCAAGATAGATTTGGAATAGGAATAAGCTCTAGTTCTTAACAACCG 
ACACTCCTACAAGATTTAGAAAAAAGTTTACAACATAATCTAGTTTACAGAAAAATCTTGTGCTAGAATA 
CTTTTTAAAAGGTATTTTGAATACCATTAAAACTGCTTTT^^ 

55 TTCTGCTTCAATAAATCTTTGGAAAACTCCA 

SEQ ID NO:497 

>gi 1 4507895 |ref|NP_003371.l| vimentin [Homo sapiens] 

MSTRSVSSSSYRRMFGGPGTASRPSSSRSYvTTSTRTYSLGSALRPSTSRSLYA5SPGGVYATRSSAVR1j 
60 RSSV^GVRLLQDSTOFSI^AINTEFKOTRTI^ 
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ACCAGTACAGGCCCCAGAACGAAGACGAGCTGGAGCTGCGCGAGGGGGACAGGGTGGATGTCATGCAGCAGTGTG 
ACGATGGCTGGTTTGTGGGTGTCTCCCGGAGGACCCAGAAATTCGGAACGTTCCCTGGAAATTACGTTGCCCCGG 
TGTGAGTGGTCTCCATGGCAACTTGGAGCCAGCCAGGATGGGGTGGGGAGCGGTGGCACTCGTGGGAGGGAGAGG 
ACCCCCGCCCACATCCTCCTTCCCCAGGACCTGAGCTCCCAGCATCTGCAGACGACCCCCGCAGCCTTTCCCTCG 
GACCCCCCTCGAAGCCCCCTGGACTGATTCCCACCACGATC 

SEQ ID NO:508 

>GL3_15J_2__M13F Direction: anti- sense 

ACCAGTACAGGCCCCAGAACGAAGACGAGCTGGAGCTGCGCGAGGGGGACAGGGGTGGATGTCATGCAGCAGTGT 

GACGATGGCTGGTTTGTGGGTGTCTCCCGGAGGACCCAGAAATTCGGAACGTTCCCTGGAAATTACGTTGCCCCG 

GTGTGAGTGGTCTCCATGGCAACTTGGAGCCAGCCAGGATGGGGTGGGGAGCGGTGGCACTCGTGGGAGGGAGAG 

GACCCCCGCCCACATCCTCCTTCCCCAGGACCTGAGCTCCCAGCATCTGCAGACGACCCCCGCAGCCTTTCCCTC 
GGACCCTCGAA 

SEQ ID NO:509 

>GL3_15J_2_M13R Direction: sense 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 

GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 

GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGGTCCTGGGGAAGGAGGATGTGGGCG 

GGGGTCCTCTCCCTCCCACGAGTGCCACCGCTCCCCACCCCATCCTGGCTGGCTCCAAGTTGCCATGGAGACCAC 

TCACACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGACACCCACAAACCAGCC 

ATCGTCACACTGCTGCATGACATCCACCCTGTCCCCCTCGCGCAGCTCCAGCTCGTCTTCGTTCTGGGGCCTGTA 
CTGGT 

SEQK)NO:510 

>GIi3_15J"_3_M13F Direction: sense 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 

GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 

GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGGTCCTGGGGAAGGAGGATGTGGGCG 
GGGGTCCTCTCCCTCCCACGAGTGC 

SEQIDNO:511 

>G:L3_15J__4_M13F Direction: N/A 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 

GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 

GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGGTCCTGGGGAAGGAGGATGTGGGCG 

GGGGTCCTCTCCCTCCCACGAGTGCCACCGTCCCCACCCCATCCTGGCTGGCTCCAAGTTGCCATGGAGACCACT 

CACACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGACACCCACAAACCAGCCA 
TCGTCACAC 

SEQIDNO:512 

>GL3_15,JJPCR_G3F1 Direction: N/A 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 

GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 

GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGTCCTGGGGAAGGAGGATGTGGGCGG 

GGGTCCTCTCCCTCCCACAGTGCCACGTCCCCCCACCCATCCCTGGGCGTGGCTCCAAGTTGCCAGTGGAAGAAC 

ACTCACCACCCGGGGCCAACCGGTAATTTCCAAGGGAACCGGTTCCCGAAATTTCCTGGGGTCCCTCCGGGAAAA 
AACCAACAAAACAAGCCCTACGGCA 

SEQIDNO:513 

>GL3_37B_PCR_G3F1 Direction: sense 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 

GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 

GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGTCCTGGGGAAGGAGGATGTGGGCGG 

GGGTCCTCTCCCTCCCACGAGTGCCACCGCTCCCCACCCCATCCTGGCTGGCTCCAAGTTGCCATGGAGACCACT 

CACACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGACACCCACAAACCAGCCA 

TCGTCACACTGCTGCATGACATCCACCCTGTCCCCCTCGCGCAGCTCCGCTCGTCTTCGTTCTGGGGCCTGTACT 
GGT 
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SEQIDNO:514 

>GL3_3 7D_1_M13R Direction: sense 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 
GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 
5 GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGGTCCTGGGGAAGGAGGATGTGGGCG 
GGGGTCCTCTCCCTCCCACGAGTGCCACCGTCCCCACCCATCCTGGCTGGCTCCAAGTTGCCATGGAGACCACTC 
ACACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGACACCCACAAACCAGCCAT 
CGTCACACTGCTGCATGACATCCACCCTGTCCCCCTCGCGCAGCTCCAGCTCGTCTTCGTTCTGGGGCCTGTACT 
GGTCCGCAGAAAACCCAGCACAGTGGTTAGATACGATAAAGCGGACCGCTCGACTAGTCTGAGGTCTGATACTCA 
10 ACTGGACTGTCGTAATGGGGCGAATTCTTGGCAAGATTTTCCATCACACTTGCGGGGCGTCGAAGCATGCAT 

SEQIDNO:515 

>GL3__37D_2__M13R Direction: anti-sense 

ACCAGTACAGGCCCCAGAACGAAGACGAGCTGGAGCTGCGCGAGGGGGACAGGGTGGATGTCATGCAGCAGTGTG 

15 ACGATGGCTGGTTTGTGGGTGTCTCCCGGAGGACCCAGAAATTCGGAACGTTCCCTGGAAATTACGTTGCCCCGG 

TGTGAGTGGTCTCCATGGCAACTTGGAGCCAGCCAGGATGGGGTGGGGAGCGGTGGCACTCGTGGGAGGGAGAGG 

ACCCCCGCCACATTCTCCTTTCCCCAGGACCTGAGCTCCCAGCATCTGCAGACGAACCCCGGAAGCTTTCCCTCG 
GA 

20 SEQIDNO:516 

>GL3_37D_3__M13R Direction: N/A 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTTCAAAGCAGACCTCTTGTATTTGGGGAGTGGG 
GTGGAGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTTGAGATCGGTGGGTGGGAATCAGTCCAGGGGGCTT 
CGAGGGGGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATCCTGGGAGCCTCAGGTCCTGGGGAAGGAGGAT 

25 

GTGGGCGGGGGTCCTCTCCCTCCCACGAGTGCCACCGGTCCCCACCCCATCCTGGCTGGCTCCAAGTTGCCATGG 
AGACCACTCACACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGAACACCCACA 
AACCAGCCATCGTCACACTGCTGCATGACATCCACCCTGTCCCCCTCGCGCAGCTCCCAGCTCGTCTCCGTNCTG 
GGGCCTGTACTGGTC 

30 SEQEDNO:517 

>GL3_37D_4_M13R Direction: sense 

CTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGGT 

GGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGGG 

GGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTTGGGAGCTCAGGTCCTGGGGAAGGGGGATGTGGGC 

35 GGGGGTCCTCTCCCTCCCACGAGTGCCACCGTCCCCAACCCATCCTGGCTGGCTCCAAGTTGCCATGGAGACCAC 

TC^CACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGACACCCACAAACCAGCC 

ATCGTCGCACTGCTGCATGACATCCACCCTGTTCCCCTCGCGCAGTTCCAGCTCGTCTTACGTTCTGGGGCCTGT 
AACTGGGT 

40 SEQEDNO:518 

>GL3_37D_PCRJ33F1 Direction: sense 

GCTTGGGGTCTGGTCTCAATAAGGCCTGGAAATGGTCTCCGCTTCAAAGCAGACCTCTGTATTTGGGGAGTGGGG 

TGGGGAGGAAATGAAAGGGCTGTGGGAGGAATGCCTGTGAGTCGTGGGTGGGAATCAGTCCAGGGGGCTTCGAGG 

GGGGTCCGAGGGAAAGGCTGCGGGGGTCGTCTGCAGATGCTGGGAGCTCAGTCCTGGGGAAGGAGGATGTGGGCG 

45 GGGGTCCTCTCCCTCCCACGAGTGCCACCGCTCCCCACCCCATCCTGGCTGGCTCCAGTTGCCATGAACCACTCA 

CACCGGGGCAACGTAATTTCCAGGGAACGTTCCGAATTTCTGGGTCCTCCGGGAGACACCCACAAACCAGCCATC 

GTCACACTGCTGCATGACATCCACCCTGTCCCCCTCGCGCAGCTCCGCTCGTCTTCGTTCTGGGGCCTGTACTGG 

TCCGCAAACCCAGCACAGTGGTTAGATGATTAAGCGGCCGCTCGACTAGTCTGAGGTCTGATACTCACTGACTTT 
TTAA 

50 

SEQIDNO:519 

>gi 1 19923334 | ref |NM_ 005775 . 2 | Homo sapiens vinexin beta (SH3 -containing 
adaptor molecule- 1) (SCAM-1) , mRNA 

CGGACGCGTGGGCAGGCAGCAGCCGGGCAGGGATGCTCCTGCGCTCCCGGGCGGCCTCGGGCCCAGCCAC 
55 CTGCTCGCCGGGGAAGAGGACACGCAGAGGAGCAGCTGGCTTGCCCGGAGTCCTCCCACCTTGACCCAAG 
CATGCAGGGCCCACCCCGCAGCCTCCGCGCTGGGCTCAGCCTGGACGACTTCATCCCTGGCCACCTCCAG 
TCCCACATAGGGTCTTCCTCCCGGGGGACACGGGTGCCCGTGATCCGGAATGGTGGCTCCAACACCCTTA 
ATTTCCAGTTCCACGACCCCGCGCCCAGGACTGTGTGCAATGGGGGCTACACACCAAGACGAGATGCTTC 
CCAGC^CCCGGACCCTGCGTGGTAT(^GACCTGGCCAGGCCCTGGGAGCAAGCCCTCTGCAAGCACAAAG 
60 ATCCCTGCCTCCCAGCACACCCAGAACTGGTCAGCCACGTGGACCAAGGACAGCAAGCGTCGGGACAAGC 
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GCTGGGTCAAGTACGAGGGAATCGGGCCCGTGGACGAGAGCGGCATGCCCATTGCCCCCCGATCCAGCGT 
TGACAGACCCAGAGACTGGTACCGGAGAATGTTCCAGCAGATTCACCGGAAAATGCCAGACTTGCAGCTG 
GACTGGACCTTCGAGGAGCCACCCAGAGACCCCAGGCATCTAGGAGCCCAGCAAAGACCTGCCCACAGGC 
CCGGCCCGGCAACATCTTCCAGTGGAAGAAGCTGGGACCACTCTGAAGAGTTACCTAGAAGCACCTTCAA 
5 CTACAGACCTGGAGCATTCTCCACTGTGCTGCAGCCCTCAAATCAGGTGCTCAGACGCCGGGAAAAAGTA 
GACAATGTCTGGACGGAAGAGTCCTGGAACCAGTTTCTGCAGGAACTAGAGACTGGGCAGAGGCCCAAGA 
AACCGCTGGTGGACGACCCTGGTGAGAAGCCCTCCCAGCCCATTGAGGTGCTGCTGGAGAGAGAGCTGGC 
CGAGCTGAGCGCCGAGCTGGACAAGGACCTGCGGGCAATTGAGACCCGACTGCCGTCCCCCAAGAGCTCG 
CCGGCGCCCCGACGGGCCCCGGAGCAGCGGCCCCCGGCCGGCCCGGCCTCAGCCTGGAGCTCCAGCTACC 

10 CACATGCACCTTACCTGGGTTCCGCCCGGTCCCTGAGTCCCCACAAAATGGCTGATGGAGGAAGCCCCTT 
CCTAGGTCGGAGGGACTTTGTCTACCCTTCCTCAACCCGAGACCCTAGTGCCTCTAACGGAGGGGGCAGC 
CCAGCCAGGAGGGAAGAGAAGAAGAGAAAGGCCGCCAGGCTCAAGTTTGACTTCCAGGCGCAGTCCCCCA 
AGGAGCTGACTCTGCAGAAGGGTGACATTGTCTACATCCACAAGGAGGTGGACAAGAACTGGCTGGAGGG 
AGAGCACCACGGCCGCCTGGGCATCTTCCCTGCTAATTATGTGGAGGTGCTGCCCGCAGATGAGATCCCT 

15 AAGCCCATCAAGCCCCCGACCTACCAGGTGCTGGAGTATGGAGAGGCTGTGGCCCAGTACACCTTCAAGG 
GGGACCTGGAGGTGGAGCTGTCCTTCCGCAAGGGAGAGCACATCTGCCTGATCCGCAAGGTGAACGAGAA 
CTGGTACGAGGGACGCATCACGGGCACGGGGCGCCAAGGCATATTCCCTGCCAGCTACGTGCAGGTGTCT 
CGTGAACCCCGGCTCCGGCTCTGTGACGACGGCCCCCAGCTCCCCACGTCTCCCCGCCTGACCGCTGCCG 
CCCGCTCAGCCCGTCACCCCAGCTCCCCCTCAGCCCTGCGCAGCCCAGCTGACCCCACCGACTTGGGGGG 

20 ACAGACCTCCCCCCGTCGCACTGGCTTCTCCTTCCCCACCCAGGAGCCTAGACCCCAGACCCAGAATCTT 
GGCACCCCTGGTCCAGCTCTGTCCCACTCTCGAGGTCCCAGCCATCCCCTGGACCTGGGGACCTCCTCTC 
CTAACACCTCTCAGATACACTGGACCCCGTACCGGGCGATGTACCAGTACAGGCCCCAGAACGAAGACGA 
GCTGGAGCTGCGCGAGGGGGACAGGGTGGATGTCATGCAGCAGTGTGACGATGGCTGGTTTGTGGGTGTC 
TCCCGGAGGACCCAGAAATTCGGAACGTTCCCTGGAAATTACGTTGCCCCGGTGTGAGTGGTCTCCATGG 

25 

CAACTTGGAGCCAGCCAGGATGGGGTGGGGAGCGGTGGCACTCGTGGGAGGGAGAGGACCCCCGCCCACA 
TCCTCCTTCCCCAGGACCTGAGCTCCCAGCATCTGCAGACGACCCCCGCAGCATTTCCCTCGGACCCCCC 
TCGAAGCCCCCTGGACTGATTCCCACCCACGACTCACAGGCATTCCTCCCACAGCCCTTTCATTTCCTCC 
CCACCCCACTCCCCAAATACAGAGGTCTGCTTTGAAGCGGAGACCATTTCCAGGCCTTATTGAGACCAGA 
CCCCAAGTCCCCCACCCCCATCCTGCTCCAGCGTTTCCTCTAACAGGGACCAGCTCTCCGCTTTGCCCCC 

30 ACGGGGTTCCTCTAACCAGAACCAGCTTCCTAGCCTCGTAGAGACCAAAGGCCGCCCCCGCCTGCTGGGG 
TTCCTCCCAGCACCCCAGCTTGCTGGCTGCCCTCTTTGCCTTCTGGCCTCCAGCTGGGTGTGGGGGGGCG 
GAGCAAGGCGGGGGACAGACGCAGCACCTTCTTAGCGATCTAGGCCTGGCAAGAGCTCTGGCCCCAAGGC 
CTCCTCTTCCCAGGGGCTGCCAAGTCCTGGCCCTGGCCCTGGCATATCACCCCGCACTGTGGGGCCAGGC 
ACCACTAGCCTGGCTCAAATATTCCCCAGGGAGACTGCTGTGTGCTGCCCGCCTGCCTGCTGGCTCTCCC 

35 CCAGCCCCACATCCCCTCTGGAAGAGAATGTAAAATAAA.CCTGGACACAAGGGCCTTCGTGGCCTCGA 

SEQ ID NO.520 

>gi 1 19923335 | ref |NP_005766 . 2 | vinexin beta (SH3 -containing adaptor 
molecule-1) [Homo sapiens] 
40 MQGPPRSLRAGLSIiDDFIPGHLQSHIGSSSRGTRVPVIRNGGSNTLNFQFHDPAPRTVCNGGYTPRRDAS 
QHPDPAWYQTWPGPGSKPSASTKIPASQHTQMWSATWTKDSKRRDIGIWVKYEGIGPVDESGMPIAPRSSV 
DRPRDWYRRMFQQIHRKMPDLQLDWTFEEPPRDPRHLGAQQRPAHRPGPATSSSGRSWDHSEELPRSTFN 
YRPGAFST^QPSNQVLRRREKVDimn^ 

ELSAELDKDLRAIETRIjPSPKSSPAPRRAPEQRPPAGPASAWSSSYPHAPYIjGSARSLSPHKMADGGSPF 
45 LGRRDFWPSSTRDPSASNGGGSPARREEKKRKAARLKFDFQAQSPKELTL^ 

EHHGRIJGIFPA^^YVEVLPADE IPKPI KPPTYQVLEYGEAVAQYTFKGDLEVELS FRKGEHICLIRKVNEN 
WYEGRITGTGRQGIFPASYVQVSREPRLRLCDDGPQLPTSPRLTAAARSARHPSSPSALRSPADPTDLGG 
QTSPRRTGFSFPTQEPRPQTQNLGTPGPALSHSRGPSHPLDLGTSSPNTSQIHWTPYRAMYQYRPQNEDE 
LELREGDRVDVMQQCDDGWFVGVSRRTQKFGTFPGWYVAPV 

50 

SEQ ID NO: 521 

NM_004265 (Homo sapiens fatty acid desaturase 2 (FADS2) ) NM_004265 . 2 
GI:14141180 

1 agggggcgcg gtgggaggag taggagaaga caaaagccga aagcgaagag ggcccgggct 
55 61 gcacacaccg gctgggaggc agccgtctgt gcagcgagca gccggcgcgg ggaggccgca 

121 gtgcacgggg cgtcacagtc ggcaggcagc atggggaagg gagggaacca gggcgagggg 
181 gccgccgagc gcgaggtgtc ggtgcccacc ttcagctggg aggagattca gaagcataac 
241 ctgcgcaccg acaggtggct ggtcattgac cgcaaggttt acaacatcac caaatggtcc 
301 atccagcacc cggggggcca gcgggtcatc gggcactacg ctggagaaga tgcaacggat 
60 361 gccttccgcg ccttccaccc tgacctggaa ttcgtgggca agttcttgaa acccctgctg 

421 attggtgaac tggccccgga ggagcccagc caggaccacg gcaagaactc aaagatcact 
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481 gaggacttcc gggccctgag gaagacggct gaggacatga acctgttcaa gaccaaccac 
541 gtgttcttcc tcctcctcct ggcccacatc atcgccctgg agagcattgc atggttcact 
601 gtcttttact ttggcaatgg ctggattcct accctcatca cggcctttgt ccttgctacc 
661 tctcaggccc aagctggatg gctgcaacat gattatggcc acctgtctgt ctacagaaaa 
5 721 cccaagtgga accaccttgt ccacaaattc gtcattggcc acttaaaggg tgcctctgcc 

781 aactggtgga atcatcgcca cttccagcac cacgccaagc ctaacatctt ccacaaggat 
841 cccgatgtga acatgctgca cgtgtttgtt ctgggcgaat ggcagcccat cgagtacggc 
901 aagaagaagc tgaaatacct gccctacaat caccagcacg aatacttctt cctgattggg 
961 ccgccgctgc tcatccccat gtatttccag taccagatca tcatgaccat gatcgtccat 

10 1021 aagaactggg tggacctggc ctgggccgtc agctactaca tccggttctt catcacctac 

1081 atccctttct acggcatcct gggagccctc cttttcctca acttcatcag gttcctggag 
1141 agccactggt ttgtgtgggt cacacagatg aatcacatcg tcatggagat tgaccaggag 
1201 gcctaccgtg actggttcag tagccagctg acagccacct gcaacgtgga gcagtccttc 
1261 ttcaacgact ggttcagtgg acaccttaac ttccagattg agcaccacct cttccccacc 

15 1321 atgccccggc acaacttaca caagatcgcc ccgctggtga agtctctatg tgccaagcat 

13 81 ggcattgaat accaggagaa gccgctactg agggccctgc tggacatcat caggtccctg 
1441 aagaagtctg ggaagctgtg gctggacgcc taccttcaca aatgaagcca cagcccccgg 
1501 gacaccgtgg ggaaggggtg caggtggggt gatggccaga ggaatgatgg gcttttgttc 
1561 tgaggggtgt ccgagaggct ggtgtatgca ctgctcacgg accccatgtt ggatctttct 

20 1621 ccctttctcc tctccttttt ctcttcacat ctcccccata gcaccctgcc ctcatgggac 

1681 ctgccctccc tcagccgtca gccatcagcc atggccctcc cagtgcctcc tagccccttc 
1741 ttccaaggag cagagaggtg gccaccgggg gtggctctgt cctacctcca ctctctgccc 
1801 ctaaagatgg gaggagacca gcggtccatg ggtctggcct gtgagtctcc ccttgcagcc 
1861 tggtcactag gcatcacccc cgctttggtt cttcagatgc tcttggggtt cataggggca 

25 1921 ggtcctagtc gggcagggcc cctgaccctc ccggcctggc ttcactctcc ctgacggctg 

1981 ccattggtcc accctttcat agagaggcct gctttgttac aaagctcggg tctccctcct 
2 041 gcagctcggt taagtacccg aggcctctct taagatgtcc agggccccag gcccgcgggc 
2101 acagccagcc caaaccttgg gccctggaag agtcctccac cccatcacta gagtgctctg 
2161 accctgggct ttcacgggcc ccattccacc gcctccccaa cttgagcctg tgaccttggg 

30 2221 accaaagggg gagtccctcg tctcttgtga ctcagcagag gcagtggcca cgttcaggga 

2281 ggggccggct ggcctggagg ctcagcccac cctccagctt ttcctcaggg tgtcctgagg 
2341 tccaagattc tggagcaatc tgacccttct ccaaaggctc tgttatcagc tgggcagtgc 
24 01 cagccaatcc ctggccattt ggccccaggg gacgtgggcc ctgcaggctg caggagggca 
2461 ctggagctgg gaggtctcgt cccagccctc cccatctcgg ggctgctgtg tggacggcgc 

35 2521 tgcctcaggc actctcctgt ctgaacctgc ccttactgtg tttaacctgt tgctccagga 

2581 tgcattctga taggaggggg cggcagggct gggccttgtg acaatctgcc tttcaccaca 
2641 tggccttgcc tcggtggccc tgactgtcag ggagggccag ggaggcagag cgggagggag 
2701 tctcaggagg aggctgccct gaggggctgg ggagggggta cctcatgagg accagggtgg 
2761 agctgagaag aggaggaggt gggggctgga ggtgctggta gctgagggga cgggcaagtg 

40 2821 agaggggagg gagggaagtc ctgggaggat cctgagctgc tgttgcagtc taacccacta 

2881 atcagttctt agattcaggg gaagggcagg caccaacaac tcagaatggg ggctttcggg 
2941 gagggcgcct agtcccccca gctctaagca gccaggaggg acctgcatct aagcatctgg 
3001 gttgccatgg caatggcatg ccccccagct actgtatgcc cccgaccccc gcagaggcag 
3061 aatgaaccca tagggagctg atcgtaatgt ttatcatgtt acttccccac ccctacattt 

45 3121 tttgaaataa aataaggaat tttattctc 

SEQ ID NO:522 

NP_004256 (fatty acid desaturase 2; linoleoyl-CoA desaturase (delta-6- 
desaturase) -like 2; delta-6 fatty acid desaturase) [Homo 
50 sapiens] .NP_004256.1 GI: 4758334 

1 mgkggnqgeg aaerevsvpt fsweeiqkhn Irtdrwlvid rkvynitkws iqhpggqrvi 
61 ghyagedatd afrafhpdle fvgkflkpll igelapeeps qdhgknskit edfralrkta 
121 edmnlfktnh vffllllahi ialesiawft vfyfgngwip tlitafvlat sqaqagwlqh 
181 dyghlsvyrk pkwnhlvhkf vighlkgasa nwwnhrhfqh hakpnifhkd pdvnmlhvfv 
55 241 lgewqpieyg kkklkylpyn hqheyfflig ppllipmyfq yqiimtmivh knwvdlawav 

301 syyirffity ipfygilgal lflnfirfle shwfvwvtqm nhivmeidqe ayrdwfssql 
361 tatcnveqsf fndwfsghln fqiehhlfpt mprhnlhkia plvkslcakh gieyqekpll 
421 ralldiirsl kksgklwlda ylhk 
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SEQ ID NO:523 

AF134404 (Homo sapiens delta-6 fatty acid desaturase (CYB5RP) mRNA) , 
AF134404.1 GI;4868365 

1 cttcgcttcc ctcggggtct tgctcggacc tcggccaccg cctgggatcc ccaggactcg 
5 61 tgcgtgcagc atgggcggcg tcggggagcc gggaccgcgg gagggacccg cgcagccggg 

121 ggcaccgctg cccaccttct gctgggagca gatccgcgcg cacgaccagc ccggcgacaa « 
181 gtggctggtc atcgagcgcc gcgtctacga catcagccgc tgggcacagc ggcacccagg 1 
241 gggcagccgc ctcatcggcc accacggcgc tgaggacgcc acggatgcct tccgtgcctt 

3 01 ccatcaagat ctcaattttg tgcgcaagtt cctacagccc ctgttgattg gagagctggc . 
10 361 tccggaagaa cccagccagg atggacccct gaatgcgcag ctggtcgagg acttccgagc 

421 cctgcaccag gcagccgagg acatgaagct gtttgatgcc agtcccacct tctttgcttt 

4 81 cctactgggc cacatcctgg ccatggaggt gctggcctgg ctccttatct acctcctggg 
541 tcctggctgg gtgcccagtg ccctggccgc cttcatcctg gccatctctc aggctcagtc 
601 ctggtgtctg cagcatgacc tgggccatgc ctccatcttc aagaagtcct ggtggaacca 

15 661 cgtggcccag aagttcgtga tggggcagct aaagggcttc tccgcccact ggtggaactt 

721 ccgccacttc cagcaccacg ccaagcccaa catcttccac aaagacccag acgtgacggt 
7 81 ggcgccegtc ttcctcctgg gggagtcatc cgtcgagtat ggcaagaaga aacgcagata 
841 cctaccctac aaccagcagc acctgtactt cttcctgatc ggcccgccgc tgctcaccct 
901 ggtgaacttt gaagtggaaa atctggcgta catgctggtg tgcatgcagt gggcggattt 
20 961 gctctgggcc gccagcttct atgcccgctt cttcttatcc tacctcccct tctacggcgt 

1021 ccctggggtg ctgctcttct ttgttgctgt cagggtcctg gaaagccact ggttcgtgtg 
1081 gatcacacag atgaaccaca tccccaagga gatcggccac gagaagcacc gggactgggt 
1141 cagctctcag ctggcagcca cctgcaacgt ggagccctca cttttcacca actggttcag 

12 01 cgggcacctc aacttccaga tcgagcacca cctcttcccc aggatgccga gacacaacta 
25 12 61 cagccgggtg gccccgctgg tcaagtcgct gtgtgccaag cacggcctca gctacgaagt 

1321 gaagcccttc ctcaccgcgc tggtggacat cgtcaggtcc ctgaagaagt ctggtgacat 

13 81 ctggctggac gcctacctcc atcagtgaag gcaacaccca ggcgggcaga gaagggctca 
1441 gggcaccagc aaccaagcca gcccccggcg ggatcgatac ccccacccct ccactggcca 
1501 gcctgggggt gcactgcctg ccctcctggt actgttgtct tcccctcggc cccctcacat 

30 1561 gtgtattcag cagccctatg gccttggctc tgggcctgat gggacagggg tagagggaag 

1621 gtgagcatag cacattttcc tagagcgaga attgggggaa agctgttatt tttatattaa 
1681 aatacattca gatgtaaaaa aaaaa 

SEQ ID NO:524 

35 AAD31282 (delta-6 fatty acid desaturase) [Homo sapiens] . AAD31282.1 
GI:4868366 

1 mggvgepgpr egpaqpgapl ptfcweqira hdqpgdkwlv ierrvydisr waqrhpggsr 
61 lighhgaeda tdafrafhqd lnfvrkflqp lligelapee psqdgplnaq lvedfralhq 
121 aaedmklfda sptffafllg hilamevlaw lliyllgpgw vpsalaafil aisqaqswcl 
40 181 qhdlghasif kkswwnhvaq kfvmgqlkgf sahwwnfrhf qhhakpnifh kdpdvtvapv 

241 fllgessvey gkkkrrylpy nqqhlyffli gpplltlvnf evenlaymlv cmqwadllwa 
301 asfyarffls ylpfygvpgv llffvavrvl eshwfvwitq mnhipkeigh ekhrdwvssq 
361 laatcnveps lftnwfsghl nfqiehhlfp rmprhnysrv aplvkslcak hglsyevkpf 
421 ltalvdivrs lkksgdiwld aylhq 

45 
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